CLASSIFIED 


ARMED  SERVICES  TECHNICAL  INFORMAIION  ACmT 
AUCnni  HALL  STAIKN 
ARUNGTON  12,  VlRiaNIA 


NOTICE:  When  govensBent  or  other  drawings,  speci¬ 
fications  or  other  data  axe  used  for  any  purpose 
other  than  in  connection  with  a  definitely  related 
government  procuxeaient  opexation,  the  U.  S. 
Qovemment  thereby  incurs  no  responsibility,  nor  any 
obligation  tdiatsoever;  and  the  fact  that  the  Qovexn- 
nent  nay  have  fonnilated,  fumi^ed,  or  in  any  way 
supplied  the  said  drawings,  specifications,  or  other 
data  is  not  to  be  regarded  by  l]ig)llcation  or  other¬ 
wise  as  in  any  manner  licensing  the  holder  or  any 
other  person  or  cozpoxation,  or  conveying  any  rights 
or  peivisslon  to  aanufacture,  use  or  sell  aqy 
patented  invention  that  nay  in  any  way  be  related 
thereto. 


AStIA  PO0UM|B|fT  NQ?  At) 
ck^HA^f  Af"  iwiio.  67 


BANP  TlfE0RY,  VALEl^CiS  BOTlD  AN0 
Tioht^binbing  capgui*aYions 

f  ‘  .  '  P«-r-01aiy^L.6wdia; 


For  R«fcafcA  in  «n4  $dUii>Stafe  TBiioify- 

-  Upplida''&iaiyeTsi67>.;.Vpp*al«,'  '..' 


The  research  reported  in  this  document  has  been  supported  in  part  by  the 
AERONAUTICAL  RESEARCH  LABORATORY 
of  the  OFFICE  OF  AEROSPACE  RESEARCH,  UNITED  STATES  AIR  FORCE 


through  its  Europe*  ifice 


ASTIA  DOCUMENT  NO:  AD 
CONTRACT  AF  6l(052)-351 


TN  No.  67 


TECHNICAL  NOTE 


BAND  THEORY,  VALENCE  BOND  AND 
TIGHT -BINDING  CALCULATIONS* 


by 


Per-Olov  Lbwdin 


Quantum  Chemistry  Group 

For  Research  in  Atomic,  Molecular  and  Solid-State  Theory 
Uppsala  University,  Uppssila,  Sweden 


Invited  paper  presented  at  the  International 
Conference  on  Chemical  Physics  of  Non- 
metalllc  Crystals,  August  28  -  31,  1961,  at 
Northwestern  University,  Evimston,  Illinois, 


October  15,  I96l 


The  research  reported  in  this  document 
has  been  sponsored  in  part  by  the 
CHEMISTRY  RESEARCH  BRANCH,  ARL,  AFRD, 
of  the  AIR  RESEARCH  AND  DEVELOPMENT  COMMAND, 
UNITED  STATES  AIR  FORCE, 
through  its  European  Office. 


CONTENTS 


page 

1 .  Introduction  1 

2.  Fundcuments  of  Band  Theory 

(a)  Hartree-Fock  Approximation  2 

(b)  Translational  Symmetry  7 

(c)  Calculations  of  Band  Structures  15 

(d)  Shortcomings  of  Band  Theory;  Correlation  Error  24 

3>  Valence  Bond  Method 

(a)  Covalent  Bond;  Valence  Bond  Functions  31 

(b)  .  Dirac -Van  Vleck  Vector  Model  .  36 

.  (c)  Extension  of  Valence-Bond  Method  3g 

4>  Tight-Binding  Approximation 

(a)  Basic  Problems  40 

(b)  Recent  Applications  51 

(c)  Virial  Theorem  in  Theory  of  Ionic  Crystals  ^  53 

5.  Extension  of  Band  Theory;  Different  Orbitals  for 

Different  Spins  56 

6.  General  Self- Consistent-Field  Theory  and  Exact 

Solution  to  Many-Electron  Problem  68 

7<  Concluding  Remarks  82 


ERRATA 


p.  4,  reference  8;  J.C.  Slater,  Phys.  Rev.  846  (1937). 
p.  7,  3  lines  below  eq.  (13):  0<  G-1  . 

p.  8,  eq.  (16):  In  the  exponential  factor  read  "Ziri -ft.m". 
p.  9,  line  11  from  the  bottom:  Read  "...  fact,  that  the  eigenvalues  e{‘k)  . 
p.  28,  5  lines  below  reference  63:  Read  "...  approach  the  correct  value 
for  R  =  00  .  The  general  ...  " 

p.  31,  reference  70:  W.  Heitler  and  F.  London,  Z.  Physik  44,  455  (1927). 
p.  45,  3  lines  above  eq.  66:  Read  "...  whereas  the  orthonormality  con¬ 
dition  ^  =  1  leads  to  ...  " 

p.  71,  1  line  above  eq.  (103):  Read  "...  satisfying  (H-E)?  s  0  ,  one  has  . 


I 


ABSTRACT 

In  the  theory  of  the  electronic  atructure  of  rryatala,  the  fundamental 
features  of  the  band  theory,  the  valence  bond  method,  and  the  tight-binding 
approximation  are  reviewed.  The  band  theory  is  studied  on  the  basis  of  the 
Hartree-Fock  scheme,  and  the  Bloch  functions  are  formed  by  a  projection 
teclmique.  The  main  methods  for  calculating  Hartree-Fock  functions  in  a 
solid  are  briefly  discussed.  The  advantages  and  disadvantages  of  the  band 
theory  and  the  valence  bond  method  are  emphasised,  and  special  attention  is 
paid  to  the  correlation  error. 

In  connection  with  the  tight-binding  approximation,  the  importance  of 
the  continuum  part  and  of  the  approximate  linear  dependencies  is  stressed. 

It  is  shown  that  a  complete  orthonormal  set  of  translationally  connected 
atomic  orbitals  may  be  constructed  as  a  convenient  basis  for  this  approach. 

The  implication  of  the  virial  theorem  in  interpreting  the  cohesive  properties 
of  the  ionic  crystals  is  further  emphasised. 

Some  recent  refinements  of  band  theory  are  then  discussed.  It  is 
shown  that  a  large  part  of  the  correlation  error  can  be  removed  by  permitting 
"different  orbitals  for  different  spins".  This  leads  to  a  scheme  intermediate 
between  band  theory  and  valence  bond  method  and,  by  means  of  a  single 
parameter,  one  can  obtain  an  essential  lowering  of  the  energy  curve  and  the 
correct  asymptotic  behaviour  for  separated  atoms  or  constituents.  This 
approach  may  be  generalized  to  an  extension  of  the  Hartree-Fock  scheme, 
where  the  total  wave  function  is  defined  as  a  projection  of  a  Slater  determinant. 

The  band  theory  can  be  further  refined  and  connected  to  the  exact 
solution  of  the  many -electron  Schrddinger  equation  of  the  crystal  by  means  of 
an  extension  of  the  self-consistent-field  scheme,  utilizing  the  so-called  reac¬ 
tion  operator  here  exactly  defined  by  means  of  a  simple  partitioning  technique. 
The  various  types  of  self-consistent  field  theories  are  finally  compared. 


-1- 


1.  INTRODUCTICWr 

The  quantum  theory  of  the  electronic  structure  of  crystals  has 
historically  been  developed  essentially  along  two  main  lines  based  on  band 
theory  and  valence  bond  method,  respectively<  Both  approaches  are  to  a 
certain  extent  approximate,  and  the  former  seems  to  be  more  appropriate 
for  describing  conductors  and  semi-conductors,  whereas  the  latter  seems 
particularly  convenient  for  studying  insulators.  Actually,  both  methods  are 
needed  in  order  to  understand  the  general  properties  of  crystals  amd  their 
electric,  magnetic,  optical,  cohesive,  elastic,  and  thermal  behaviour,  and 
the  fundamental  problem  is  then  how  they  could  be  combined  and  refined  to 
give  any  accuracy  desired. 

In  this  survey,  the  recent  progress  in  this  field  will  be  briefly 
reviewed.  The  advantages  and  disadvantages  of  band  theory  and  valence  bond 
method  will  be  discussed,  and  the  nature  of  the  approximations  and  errors 
involved  will  be  investigated.  Special  attention  is  given  the  so-called  tight- 
-binding  approximation,  and  the  importance  of  the  virial  theorem  in  inter¬ 
preting  energy  results  in  crystal  theory  will  be  emphasized. 

A  simple  generalization  of  band  theory  to  include  correlation  effects 
will  be  described.  It  will  be  shown  that  the  main  advantages  of  band  theory 
and  valence  bond  method  may  be  further  enhanced  and  the  disadvantages  and 
errors  partly  removed  by  a  synthesis  of  the  two  ideas,  which  may  be  charac¬ 
terized  as  a  band  theory  with  different  orbitals  for  different  spins. 

The  relation  between  band  theory  and  the  exact  many-electron  theory 
of  a  crystal  will  be  further  studied..  It  will  be  shown  that,  in  connection  with 
the  exact  description,  there  exists  a  one-electron  model  based  on  a  general 
self-consistent-field  scheme  which  may  be  considered  as  an  extension  of 
Brueckner's  generalization  of  the  Hartree-Fock  approximation.  This  result 
is  obtained  by  means  of  the  exact  reaction  operator  which  is  here  derived 
by  a  partitioning  technique  offering  a  simple  and  forceful  alternative  to  the 
otherwise  usednfinite-order  perturbation  theory. 

In  conclusion,  the  various  approaches  will  be  compared  and  discussed. 
By  means  of  density  matrices,  it  will  be  shown  that,  independent  of  the  way 
one  is  solving  the  Schrbdinger  equation,  certain  aspects  of  the  one -electron 
band  theory  will  be  preserved  also  in  the  exact  many-electron  theory,  for 


-2- 


instance  the  concepts  of  reduced  wave  vector  , .  effective  mass,  etc. 

Since  we  are  here  mainly  interested  in  the  electronic  structure  of  ' 
crystals,  we  will  throughout  the  entire  paper  assume  that  the  nuclei  are 
fixed  in  the  positions  characteristic  for  the  lattice  under  (consideration,  and 
that  the  nuclear  coordinates  may  be  treated  as  parameters  in  the  electronic 
wave  function  (Bom>Oppeaheimer  approximation). 


2.  FUNDAMENTS  OF  BAND  THEORY 

i 

(a)  Hartree-Fock  Approxiznation 

The  band  theory  6L  crystals  is  physically  built  on  the  independent- 
-particle -model,  according  to  which  each  electron  in  a  many-electron 
system  moves  under  the  influence  of  the  outer  field  amd  the  "average"  field 
of  all  the  other  electrons  For  each  electron,  there  exists  an  effective 


N.  Bohr,  Proc.  London  Phys.  Soc.  35,  296  (1923). 


Hamiltonian  H^^^  and  a  Schrddinger  equation  of  the  form 

where  is  a  spin-orbital,  1b  the  space-spin  coordinate 

of  electron  1,  and  €4^  the  corresponding  one-electron  energy.  In  the  Hartree- 
-Fock  scheme  the  total  electronic  wave  function  Y  is  approximated  by  a. 


D.R.  Hartree,  Proc.  Cambridge  Phil.  Soc.  24,  89  (1928);  V.  Fock, 
Z.  Physik^,  126  (1930);  J.C,  Slater,  Phys.  Rev.  35,  210  (1930); 
P.A.M.  Dirac,  Proc.  Cambridge  Phil,  Soc.  26,  376  (1930);  27  ,  240 
(1931). 


single  Slater  determinant: 


) 


(2) 


I 


3 


where  occupied  spin -'Orbitals,  wMch  are  assumed  to 

form  an.orthonormad  set.  The  effective  Hamiltonian  is  represented  by  the  ex¬ 
pression 


where  the  first  term  is  the  kinetic  energy,  the  second  the  attraction  potential 
between  electron  1  and  the  nuclei  g  ,  whereas  the  last  term  is  the  above- 
-mentioned  "average"  potential  from  all  the  other  eleotrons.  The  quantity  p 
is  the  Fock-Dirac  density  matrix: 

U 

'  -  E. 

which  satisfies  the  basic  relations 
is  sm  exchange  operator  with  respect  to  the  electronic  coordinates  X,  and 
,  and  the  corresponding  exchange  potential  has  hence  a  non-local  charac¬ 
ter  The  spin-orbital  energies  have  a  physical  meaning  in  connection 

■  4)  ^ 

with  the  first  ionization  potentials  '  and,  to  a  certain  extent,  they  may  be  used 

5) 

also  in  studying  the  excitation  energies 


^  ^  (^)  »J/ .  The  operator  P'j2 


For  the  appro^mation  of  the  exchange  potential  by  a  loced  potential, 
see  J.C.  Slater,  Phys.  Rev.  385  (1951);  V.W.  Maslen,  Proc. 
Phys.  Soc.  A^,  734  (1956);  P.O.  Ldwdin,  Phys.  Rev.  1474 
(1955);  p.  1487  f. 

T.  Koopmans,  Physica  104  (1933). 

See  e.g.  P.O.  LOwdin,  Phys.  Rev.  97,  1490  (1955),  and  references 
there • 


The  Hartree-Fock  equations  (1)  are  a  system  of  non-linear  integro- 
-differential  equations  connected  with  an  eigenvalue  problem  which  are  solved 
by  the  "self-consistent-field"  (SCF)  procedure.  This  may  be  indicated  by  the 
diagram 


-4- 


and,  after  being  started  by  an  initial  estimate  of  p  or  >*  the  cycle  is 

repeated  until  the  procedure  becomes  "self-consistent",  i.  e-  no  further  changes 
occur  in  the  significant  figures  when  the  cycle  is  repeated.  The  eigenvalue 
problem  (l)  has  in  the  atomic  case  been  solved  by  numerical  integration,  and 
this  approach  has  also  been  applied  to  crystals  in  the  cellular  method  and 
in  the  augmented  plane  wave  method  The  expansion  method  by  Ritz  was 
first  applied  to  the  SCF -procedure  in  connection  with  molecules  but  later 
this  technique  has  proven  to  be  very  useful  also  in  the  cases  qf  atoms  and 
crystals. 


6) 


V 

8) 

9) 

10) 


For  a  survey  of  the  atomic  SCF-calculations^  see  D.R.  Hartree, 

Repts.  Prog.  Phys .  11,.  113  (1948);  "Calculation  of  Atomic  Structures" 
(John  Wiley  and  Sons,  New  York  1957);  R.S.  Knox,  Solid-State  Physics 
4,  413  (Academic  Press,  New  York  1957);  P.O.  LOwdin,  Proc.  R.A. 
Welch  Foxind.  Conf.  Chem.  Res.  II.  Atomic  Structure,  5  (1958). 

£.  Wigner  and  F.  Seitz,  Phys.  Rev.  804  (1933);  509  (1934). 

J.C.  Slater,  Phys.  Rev.  846  (1937);  92,  603  (1953). 

W.  Ritz,  J.  reine  angew.  Math.  135,  1  (1909). 

C.A.  Cbulson,  Proc.  Cambridge  Phil.  Soc.  34,  204  (1938). 


The  methods  of  molecular  theory  may,  in  prhiciple,  be  applied  also  to 
crystals,  since  the  latter  are  nothing  but  molecules  of  an  Immense  size 
characterized  by  translational  symmetry.  If  one  chooses  atomic  orbitals 
(AO's)  as  a  basis  in  Ritz's  method,  the  molectUar  orbitals  (MO's)  associated 
with  a  specific  Hamiltonian  may  be  found  by  linear  combinations  of  atomic 
orbitals  (LCAO)  In  solid-state  theory  this  approach  was  introduced  by 


F.  Hund,  Z.  Physik  51,  759  (1928);  73,  1  (1931);  R.S.  Mulliken, 

Phys.  Rev.  32,  186  (1928);  £1,  49  (1932);  J.  E.  Lennard-Jones,  Trans. 
Faraday  Soc.  668  (1929).  For  a  survey,  see  R.S.  Mulliken,  J. 

<Mm.  phys.  46,  497  ,  675  (1949). 


11) 


-5- 


Bloch  and  it  goes  under  the  name  of  "tight •'binding  approximation".  The 
coefficients  in  the  MO-LCAO  expansions  may  be  determined  so  that  the 
molecular  orbitals  become  Hartree-Fock  functioni  by  an  iteration  procedure 
analogous  to  (5)  and,  since  the  total  wave  function  is  approximated  by  a 
single  Slater  determinant  or  antisymmetrized  product  (ASP),  the  entire 
approach  is  often  denoted  by  the  symbol  ASP-MO-LCAO-SCF  introduced 
by  MullUcen.  Cven  direct  methods  for  evaluating  ^  without  thb  use  of 


13) 


{ have  been  deyelbped  . 


12) 

13) 

14) 


F.  Bloch,  Z.  Physik  52,  555  (1929);  57.  545  (1929)- 

C^  C.J.  Roothaan^  Revs.  Modern  Phys.  23,  69  (1951). 

R.  McWeeny,  Proc.  Roy.  Soe.  (London)  A235,  496  (1956);  A237 ,  355 
(1956);  Technical  Note  6i,  Uppsala  Quantum  Chenaistry  Group  (1961), 
(unpublished). 


The  Hartree 'Fock  scheme  may  he  considered  as  an  approximate  meth¬ 
od  for  solving  the  many-electron  Schrddinger  equation 


(6) 


where  7  -  7  (x^,  x^,  •  *  >  x^^)  is  the  many-electron  wave  function  subject  to  . 
the  antisymmetry  requirement  PT  a  (-l)^l'  corresponding  to  Pauli's 
exclusion  principle.  For  a  crystal  with  fixed  i.uclei,  the  total  Hamiltonian  has 
the  form: 


T 


Y  ^  ^ 


e> 


t 


(V 


where  the  first  term  represents  the  nuclear  repulsion,  the  second  the  kinetic 
energy  of  the  electrons,  the  third  the  attraction  between  the  electrons  and  the 
nuclei,  and  the  fourth  the  mutual  electronic  repulsion.  Spin-coupling  terms 
are  easily  added. 

One  may  solve  the  eigenvalue  problem  (6)  by  means  of  the  variation 


-6- 


principle  6<  =  0  .  If  the  total  wave  function  is  approximated  by  a 

single  Slater  determinant,  this  leads  to  the  Hartree-Fock  equations  (l)  with 
an  effective  Hamiltonian  given  by  (3).  For  the  ground  state,  the  corresponding 
total  energy  upper  bound  to  the  true  eigenvalue  E  * 

and  the  energy  error  (E  •  "correlation  energy?  may  be  used  as  a 

measure  of  the  accuracy  of  the  entire  approach.  It  is  hardly  necessary  to 
emphasize  that  the  Hartree-Fock  energy  is  not  identical  with  the  sum  of  the  . 
spin-orbital  energies 


For  the  Hartree-Fock  energy,  one  may  use  anyone  of  the  following  three 
formulas: 


- 


f- 


i<k  X 


4*^ 


r  r  ^^'3 


(‘i) 


(.lO 

•A.  J  Tl,.  . 


where  the  last  form  is  simply  the  artithmetic  mean  of  the  two  first  relations. 
We  note  that,  for  crystals,  one  has  to  include  the  nuclear  repulsion  term  in 
the  calculations,  since  otherwise  will  become  divergent,  i.e.  no  longer 

proportional  to  the  volume  of  the  crystal 


P.O.  L&wdin,  Advances  in  Physics^,  1  (1956),  p.  ll  f. 


15) 


7- 


(b)  Translational  Symmetry 

An  ideal  crystal  is  characterized  by  the  tr«mslational  synunetry  which 
is  basic  for  the  understanding  of  its  fundamental  properties.  Let  (l|  ^ 
be  the  primitive  translations  of  the  ordinary  lattice  and  *^1  ^  <)f  fhe 

reciprocal  lattice,  so  that  CLfc' “  ^Jiet  •  The  vectors  rr'J  * 

where  (p^,  M'3)  is  a.  triple  of  integers,  coiuiect  equivalent  points  in  the 

ordinary  lattice,  whereas  tlie  vector  K  *  ‘*■•^^^■*■■<343  for  integer  (i^,  i^,  i^) 

connect  equivalent  points  in  the  reciprocal  lattice*  Let  further  T^,  T^,  T^ 
be  the  translational  operators  connected  with  the  primitive  translations 
respectively,  and  defined  by  the  relation 


(12) 


For  the  operator  T*  (  ^  )  connected  with  the  general  translation  one 

ha.  =  T/'  T/^  . 

The  treatnaent  of  the  translational  symmetry  is  greatly  simplified,  if 
one  introduces  the  Born>v.  Kirmin  boimdary  condition: 


M«  Born  and  T.  von  K^rmiui,  Physik.  Z.  13,  297  (1912). 


J^(^-t-QClp)  =  ^^(^)  ^  (13) 


where  G  is  a  very  large  integer,  which  defines  the  periodically  repeated 

3 

microcrystal.  Each  microcrystal  contains  G  lattice  points  characterized 
by  the  triple  (p^,  P2,  P3),  and  the  inequality  0  <  p^  <  G-1  defines  a  con¬ 
venient  "ground  domain"  (G),  It  follows  from  (13)  that  =  1  >  the  three 

translations  will  now  be  cyclic  operators  of  order  G  having  the  eigenvalues 
exp(2irlK  /G)  where  K  is  an  integer.  The  associated  eigenvalue  problem 

'•  17) 

is  now  easily  solved  by  a  simple  projection  technique  ',  which  does  not 
require  any  use  of  group  theory.  It  is  shown  that  one  may  conveniently  label 


P.O.  LSwdin,  Phys.  Rev.  97,  1509  (1955);  p.  1512;  Advances  in 
Physics^,  1  (1956),  p.  56  f. 


17) 


-8- 


the  simultaneous  eigenfunctions  to  T^,  either  by  the  triple  of  integers 
(iCp  K  2>  <3}  or  by  the  reduced  wave  vector: 


'(jtJ/c;  _  (i«) 

—  GA  —  ,  (15) 

where  the  inequality  (15)  defines  a  ground  domain  (G)  containing  G  points  in 
k- space-  The  eigenvalue  relation  may  qow  be  written  in  the  form: 


-  ^  xlik-n 

T’('tn)  -*■  itn )  srr  ^ 

For  <T()  equal  to  the  primitive  translations,  this  gives  the  famous  Bloch  con¬ 
dition.  The  corresponding  eigenfunctions  may  be  found  by  means  of  the  projec¬ 
tion  operators 


E  o 

crt» 


(Q) 


(17) 


which  fulfil  the  following  basic  relations: 


(D*""  -  _  0,.^,-  0..  ,  -  0 , (•>*1) ,  O'*) 


ATiline 

T[^)  O4,  =  C'  ©4, 


(19) 


One  has  further  the  "resolution  of  the  identity"  1  s  £  0|^  which  implies.  . 
that  every  trial  function  4  ( It  )  satisfying  the  period^ity  condition  ( 13)  may 
be  resolved  into  Bloch  components,  i.e. 


-  n  0„4W  =  c  , 

4#'  M  ' 


(20) 


-9- 


which  are  not  only  orthogonal  but  also  noni-inte racting  with  respect  to  eveiry 
operator  Q  which  commutes  with  the  translations'  T^,  according  to 
the  general  formulas 

-0  (a., 

for  different  reduced  wave  vectors  (  4;  ^  ^  The  fundamental  relations  (17)- 
-(2l)  are  easily  verified  directly. 


Bcmd  Structure;  Brillouin  Zones.  -  If  the  integer  G  characteristic  for  the 
microcrystal  is  very  large,  the  density  of  points  (14)  becomes  so  large  that 
the  set  may  be  considered  as  quasi- continuous.  It  becomes  then  .possible  to 
replace  a  summation  over  At  -space  with  a  corresponding  integral 


where  V  is  the  volume  of  the  microczystal.  This  quantity  enters  the  formula, 
since  each  discrete  point  in  k  -space  is  associated  with  tiie  volume 

*  =  Vv  . 


We  ■will  now  consider  the  spin-orbital  energies  6.  as  func¬ 

tions  of  the  qua  si -continuous  variable  ^  over  this  ground  domain.  The. 
name  "band  theory**  comes  actually  from  the  fact,  that  the  eigenvalues 
show  a  band  structure  with  the  levels  situated  in  certain  allowed  ranges  or 
"bands'*  separated  by  forbidden  regions  or  energy  "gaps".  The  ground  domain 
has  here  been  fixed  byftie  inequality  (15),  but  even  other  choices  are  possible 
and  may  physically  be  more  convenient. 

In  order  to  study  the  -space  as  a  whole,  we  will  now  introduce  the 
plane  waves  (Ati  A->i)  ,  where  ^  is  a  wave  vector 

defined  by  (14)  but  with  no  restriction  bn  the  integers  (/e  j,  K^).  Each 
A  -value  is  equivalent  to  one  and  only'  one  point  'within  the  ground 

domain  and,  since  *7^  •  equivalent 

-values  are  associated  with  the  same  translational  eigenvalue.  All  points 


-10- 


in  ^  -space  can  hence  be  divided  into  sets  of  equivalent  points,  and 

the  points  within  each  set  may  further  be  arranged  linearly  after  some 
physical  quantity,  say  Each  4s  -value  would  then  have  its  unique 

place  within  each  series,  and  ambiguities  could  occur  only  when  tWo  equivalent 
points,  M  and  h  ,  would  have  the  same  absolute  value: 

Ij'-  *  -  K  ,  1*1’'  -  l*'l’  .  <”) 


These  are  the  equations  for  the  boundaries  between  the  so-called  Bri  llouin 

18)  — . . . 

zones  the  first  zone  contains  apparently  all  non- equivalent  points  having 


L.  Brillouin,  Comp.  rend.  191,  198,  292  (1930);  J.  phys.  radium  (7), 
1.  377  (1930). 


the  smallest  value  of  |  ^  |  ^,  the  second  zone  contains  all  non-equivalent 

'2 

points  having  the  second  smallest  value  of  |  t)  |  ,  etc..  If  the  points  on  the 
boundaries  are  assigned  to  the  zones  in  a  proper  way,  each  zone  contains 
exactly  points  with  one  and  only  one  representative  for  every  set  of 

equivalent  points.  All  zones  have  the  same  volume  and  may  be  "mapped”  on  the 
first  Brillouin  zone  or  on  the  ground  domain  defined  by  (15). 

The  relations  (23)  are  in  crystal  physics  known  as  the  Laue  conditions 
for  X-ray  diffraction  in  lattices.  The  zone  structure  was  introduced  by 
Brillouin  in  a  study  of  the  energy  splitting  of  pi  we  waves  by  means  of  a  weak, 
periodic  potential,  which  he  found  caused  discontinuities  or.  "energy  gaps" 
at  the  zone  boundaries.  These  have  hence  a  simple  physical  meaning. 

The  band  splitting  through  various  types  of  periodic  potentUls  luive 
been  investigated  in  great  detail  in  a  series  of  special  examples  chosen  so  that 
the  corresponding  eigenvalue  problem  co^d  be  exactly  solved 


P.M.  Morse,  Phys.  Rev.  J[5,  1310  (1930);  R.  de  L.  Kronig  and 
W.G.  Penney,  Proc.  Roy.  Soc.  (London)  A 130,  499  (1931); 

H.A.  Kramers,  Physica  483  (1935);  J.  C.  Slater,  Phys.  Rev. 
807  (1952);  F.L.  Scarf,  Phys.  Rev.  112,  1137  (1958);  and  others. 


-11- 


In  the  following,  we  will  concentrate  our  interest  on  the  consequences 
of  the  translational  symmetry  in  the  Hartree-Fock  scheme,  and  it  is  then 
convenient  to  consider  (.  -  €(<»)  as  a  multi-valued  fimction  of  the 
reduced  wave  vector  4k  over  tiie  first  Brillouin  zone  or  over  the  ground 
domain  (G)  . 


Translations  as  Constants  of  Motion.  -  It  is  important  to  observe  the 

difference  between  a  crystal  problem  based  on  the  ass\imption  of  a  fixed 

19)  - 

periodic  potential  like  the  previously  mentioned  models  '  and  the  Hartree- 
-Fock  scheme,  where  the  potential  in  the  effective  Hamiltonian  (3)  depends 
on  the  solutions  to  the  eigenvalue  problem  (l).  The  latter  problem  is  of  a  non- 
-linear  nature  and  considerably  more  complicated*  It  can  be  approached  by 
considering  the  N-electron  operator  (y  s  1,  2,  3)  : 


W 


which  corresponds  to  a  primitive  translation  Cly  of  all  electronic  coordinates, 
i.e.  to  a  translation  of  the  electronic  cloud  as  a  whole*  Since 

<3; 

for  the  many-electron  Hamiltonian  (7),  the  total  translation  is  a  normal 

constant  of  motion  to  the  many-electron  system.  This  theorem  may  seem 
trivial,  but  it  is  actually  of  fundamental  importance  in  both  the  one-electron- 
-approximation  and  the  exact  theory . 

cr 

It  is  easily  shown  that  is  another  cyclic  operator  of  order  G  , 

and  its  eigenvalues  and  eigenfunctions  may  hence  be  derived  in  the  same  way 
as  before:  s4e;eqttatlons  (l2)-(2l).  The  eigenfunctions  may  be  labelled  by 
means  of  a  total  reduced  wave  vector  ^  of  type  (14),  restricted  to 
different  values  by  the  inequality  (15).  These  eigenfunctions  fulfil  the  general¬ 
ized  Bloch  condition 


-12- 


where  means  a  translation  of  all  electronic 

coordinates  a  vector  nnt>  •  The  associated  projec¬ 

tion  operators 

©5^  -  Q  C  e.  (27) 

^  onf| 

% 

satisfying  the  identity  i  ^  *  may  be  used  to  resolve  any  arbitrary 

many-electron  function  ^  { X,,  . . .  Xj  )  into  components 

(*») 

<2)  -  =  n  = 

^  (28) 

which  are  eigenfunctions  to  the  total  translations  5]^  .  Because  of  the  gen¬ 

eral  relations 

i 

*  0 ,  0^^  *  0  ^  (29) 

'  *  i 

these  components  are  orthogonal  and  non-interacting  with  respect  to  the  total 
Hamiltonian  H  . 

In  the  following,  we  can  concentrate  our  interest  to  a  study  of  the 
simultaneous  eigenfunctions  to  H  and  '3',,  .  From  the  SchrCdinger  equation 

Hf=Ef  follows  that  and,  for  a  non-degenerate  energy 

level,  it  is  then  evident  that  =  const,  f  ,  i.e.  f  is  also  an  eigenfunc¬ 
tion  to  .  For  a  degenerate  level,  we  consider  instead  the  resolution 

of  an  arbitrary  eigenfunction  into  Bloch-components  according  to  (28),  and  it 
follows  directly  that  each  non- vanishing  .component  is  a  simultaneous  eigen¬ 
function  to  H  and  'CT^  .  Since  <3^  is  symmetric  in  all  coordinates,  the 
antisymmetry  properties  of  the  wave  function  will  not  be  influenced  by  the 
projection  (27). 

In  the  Hartree-Fock  approximation,  we  will  now  require  that  the  total 
wave  function  represented  by  the  single  Slater  determinant  (2)  should  be  an 
exact  eigenfunction  to  the  total  translations  >3^  (v  =  1,  2,  3).  This  is  simply 


-13- 


accomplished  by  choosing  the  one-electron  functions  as  eigenfunctions 
to  the  one  -electron  translations  T  ,  and  one  obtains 


%  -  (fe. 


4 

7*3. 


(30) 


where  the  index  G  means  that  one  should  take  the  reduced  wave  vector  within 
the  ground  domain.  The  question  is  now  whether  such  a  choice  always  can  be 
made,  i.  e.  whether  it  follows  from  the  requirement  that  the  determinant  (2) 
should  be  an  eigenfunction  to  the  total  translations  CSy  that,  except  for  an 
arbitrary  imitary  transformation,  it  is  necessary  that  the  basic  spin-orbitals 
t|/^,  *  *  ‘  *1^]^  Bloch  functions  satisfying  the  relation  (16).  A  careful 

analysis  of  the  problem  shows  that  this  is  actually  the  case. 

It  seems  rather  natural  to  assume  that  the  requirement  that  the  basic 
spin-orbitals  are  Bloch  functions  also  should  be  self-con«i stent  in  the  sense 
of  the  Hartree-Fock  scheme.  From  (4)  and  (l6),  it  follows  that 


(31) 


where  ‘♦‘C.y)  denotes  the  electronic  coordinate  +  •  and  this 

relation  implies  that  the  electronic  density  has  the  periodicity  of  the  lattice. 
Equation  (31)  is  easily  derived  from  the  condition  that  the  total  wave'function 
should  be  an  eigenfunction  to  the  total  translations  and  is  valid  for  the  first- 
. -order  density  matrix  in  general.  The  density  matrix  ^  is  the  crucial 
quantity  in  the  effective  Hamiltonian  (3)  euid  by  means  of  (31),  one  can  now 
prove  the  relation 


(32) 


The  first  terms  in  are  easily  handled,  and  only  the  exchange  potential 

with  its  non-local  character  requires  more  careful  treatment.  However,  if 


^(«.) 


is  an  arbitrary  function  of  Xf  ,  one  obtains 


-14- 


v/hich  proves  that  also  the  exchange  term  commutes  with  the  primitive  transla¬ 
tions.  Hence,  the  entire  effective  Hamiltonian  H^££  commutes  with  T^,  T2. 

,  and  the  solutions  to  the  eigenvalue  problem  (l)  may  then  be  chosen  as 
simultaneous  eigenfunctions  to  all  these  operators.  For  a  crystal,  the  basic 
requirement  that  the  Hartree-Fock  f^onctions  ijij,  «|<2,  .  •  •  should  be  Bloch 
fxmctions  is  thus  self-consistent. 


Each  one  of  the  points  in  the  -space  defined  by  (14)  is 
independent  in  the  sense  that  the  associated  Bloch  functions  are  not  only 
orthogonal  but  also  non-interacting  v/ith  respect  to  the  effective  Hamiltonian 
H^££  ,  as  soon  as  p  satisfies  (31).  Informing  p  according  to  (4),.  one 
should  sum  over  all  occupied  spin-orbitals  which  are  then  associated  with  a 
certain  distribution,  of  points  in  -space.  The  boundary  of  these  occupied 
points  defines^ the  Fermi- surface  associated  with  the  system  and  state  under 
consideration. 


Crystal  Symmetry  in  General.  -  The  translational  symmetry  lias  here  been 

treated  by  a  simple  projection  operator  technique  which  requires  only  the 

knowledge  of  the  translational  eigenvalues  following  from  the  Born-  v.  K£rm£n 

boundary  condition  (13),  whereas  no  group  theoretical  information  about  the 

system  is  needed.  It  is  evident,  however,  tliat  a  still  richer  understanding  of 

20) 

this  problem  can  be  obtained  by  utilizing  group  theory  to  a  full  extent 


F.  Seitz,  Ann.  Math.  37^,  17  (.1936);  Li.P.  Bouckaert,  R.  Schmoluchov/- 
ski,  and  E.  Wigner,  Phyo.  Rev,  58  (1936);  C.  Herring,  Phys.  R.ev. 
52,  361,  365  (193?);  and  others. 


-15- 


In  addition  to  the  translational  symmetry,  there  are  adso  other  sym¬ 
metry  properties  of  the  different  crystallographic  point  groups  which  may  be 

used  for  dividing  the  various  symmetry  functions  into  non-combining 
2 1)’ 

classes  '•  Even  in  this  connection,  the  use  of  projection  operator  technique 

22) 

has  proven  to  be  simple  and  forceful 


H.A.  Bethe,  Ann.  Physik  3,  133  (1929);  Bouckaert  et.  aL,  Phys. 

Rev.  50,  58  (1936);  F.  Seitz,  Phys.  Rev.  47,  400  (1935);  Z.  Krist. 

94,  100  (1936);  C.  Herring,  J.  Franklin  Inst.  233,  525  (1942); 

J.C.  Slater  and  G. F.  Koster,  Phys.  Rev.  94,  1498  (1954);  and  others. 

M.  A.  Melvin,  Revs.  Modem  Phys.  28,  18  (1956);  H.  McIntosh, 
Technical  Note  21,  Uppsala  Quantum  Chemistry  Group  1958;  J.  Mol. 
Spectroscopy  5.  269  (I960). 


(c)  Calculations  of  Band  Structures 


The  main  problem  in  the  one -electron  theory  of  crystals  is  the  solu¬ 
tion  of  the  Hartree-Fock  equations  (l),  which  gives  the  spin-orbital  energies 
as  a  multi-valued  function  over  the  first  Brillouin  zone  or 
over  the  ground  domain  in  the  space  of  the  reduced  wave  vector  ^  ,  and 

hence  also  the  band  structure.  Since  this  is  one  of  the  key  problems  in  the 
current  solid-state  theory,  it  is  frequently  reviewed,  and  for  a  detailed 
discussion  of  the  progress  in  this  field,  we  will  refer  to  a  series  of  survey 
articles  The  recent  papers  by  Herman  and  by  Pincherle  are 
particularly  complete,  and  there  is  no  reason  to  repeat  the  material  contained 
in  these  articles.  Here  only  a  few  additional  remarks  will  be  made,  certain 
problems  will  be  discussed  from  slightly  different  points  of  view,  and  some 
recently  published  papers  will  be  listed  and  commented  upon. 


23) 


24) 

25) 


G.V.  Raynor,  Repts.  Prog.  Phys.  15^  173  (1952);  J.R.  Reitz, 
Solid  State  Physics  _1,  1  (Academic  Press,  New  York  1955); 
P.O.  LSwdin,  Advances  in  Physics  1  (1956);  J.C.  Slater, 
Encyclopedia  of  Physics  J9^,  1  (Springer,  Berlin  1956). 

F.  Herman,  Revs.  Modern  Phys.  102  (1958). 

L.  Pincherle,  Repts.  Prog.  Phys.  £3,  355  (i960). 


-16- 


The  essential  difficulty  in  the  one-electron  theory  of  crystals  seems 
to  be  connected  with  the  fact  that  the  wave  functions  hayd.  atomic  nature 
within  the  ion  cores,  whereas  they  behave  as  frefe  waves  in  the  regions  between 
the  atoms,  and  these  properties  are  apparently  hard  to  combine  -  at  least 
practically . 

m  . 

In  Rltz's  method  ',  one  expands  the  wave  function  in  terms  of  a 

complete  set  { f^  }  : 


where  the  problem  is  to  determine  the  coefficients.  It  is  convenient  to 
introduce  the  energy  matrix  M  with  respect  to  the  basis  and  the  associated 
metric  matrix  A  having  the  elements: 

IJ..)  ,  (S5); 


and  the  SchrAdinger  equation  ^^££^1^(0  -  ^  then  equivalent  with  the 

following  system  of  linear  equations: 


c  ■ 

with  the  secular  equation  (Abl  6A«^^)  =  0. 


(36) 


The  matrix  problem  (36)  can  be  essentially  simplified  if  one  utilizes 
the  existence  of  the  translational  symmetry.  Since  the  wave  functions 
should  be  Bloch  functions  ,  they  are  invariant, against  the 

corresponding  Bloch  projection  (17),  so  that  0|^ij^  ~  'Kt  ‘  the 

operator  0|^  to  both  sides  of  (34),  one  obtains  . 


(37) 


which  means  that  each  Bloch  function  may  be  expanded  in  the  associated  Bloch 


f 


-17- 


projection  of  any  complete  set.  The  functions  within  the  subset  {  }  are 

usually  not  linearly  independent,  and  an  essential  problem  is  to  eliminate  the 
redundancies  in  expansion  (37)  and  replace  it  with  a  rapidly  convergent  series. 
This  can,  for  instance,  be  done  by  an  orthonormalization  procedure  but 


P.O.  LSwdin,  Adv.  Chem.  Phys.  2,  207  (Inter science.  New  York 
1959),  p.  288  f. 


even  other  possibilities  exist.  Here  we  note  that,  by  replacing  the  complete 
set  { f  ^  }  •  by  ibe  G  subsets 


i  i. } 


which  are  mutually  orthogonal  and  non-interacting  with  respect  to  H^££  ,  one 
obtains  automatically  a  splitting  of  the  secular  equation  (36)  into 
independent  parts,  each  one  corresponding  to  a  specific  point  ^  in  the  space 
of  the  reduced  wave  vector.  This  is  an  essential  simplification  of  the  problem 
which  it  is  always  possible  to  carry  out- 

The  main  problem  in  the  application  of  the  expansion  method  to  crystal 

theory  seems  to  be  the  choice  of  the  subsets  {  }  so  that  the  convergency 

of  the  series  (37)  becomes  as  fast  as  possible  If  the  basic  set  (f^^  }  is 

chosen  to  consist  of  plane  waves  (PW),  the  convergency  will  usually  be 

very  slow,  since  many  waves  will  be  needed  to  describe  the  inner  atomic 

properties  of  the  constituents  of  the  crystal.  In  the  method  of  orthogonalized 

28) 

plane  waves  (OPW)  devised  by  Herring  the  convergency  is  essentially 


27) 


28) 


We  note  that,  since  the  subsets  are  entirely  independent,  one  may 
use  different  complete  sets  },..••  or  various 

adjustable  parameters  for  different  values  of  ^  which  inay  often 
improve  the  convergency. 

C.  Herring,  Phys.  Rev.  57,  1169  (1940). 


improved  by  choosing  a  basis  which  consists  of  the  Bloch  projections  of  the 
inner-core  atomic  orbitals  and  the  plane  waves  orthogonalized  towards  these 


-18- 


fxmctions.  In  applying  this  method  to  a  practical  problem,  one  has  to  remem¬ 
ber  that  the  inner -core  Bloch  functions  and  the  OPW’s  are  usually  interacting 
with  respect  to  the  effective  Hamiltonian,  i>e>  the  corresponding  matrix 
elements  are  not  necessarily  VcUiishing  even  if  they  may  be  small  As  a 
practical  tool,  the  method  has  been  very  forceful,  and  many  important  applica¬ 
tions  have  been  carried  out;  see  Herman  and  Pincherle 


For  critical  studies  of  the  method,  see  J.  Callaway,  Phys.  Rev.. 
97,  933  (1955);  V.  Heine,  Proc.  Roy.  Soc.  (London)  A240,  340, 
354,  361  (1957);  T.  O-  Woodruff,  Solid  State  Physics  4,  367 
(Academic  Press,  New  York  1957). 


From  studies  of  the  Knight  shift,  it  has  recently  been  observed  that 
an  OPW- calculation  which  gives  good  results  e.g.  vith  respect  to  cohesive 
and  elastic  properties  or  the  band  structure  may  not  describe  the  regions 
around  the  nuclei  very  well,  and  particularly  for  the  beryllium  metal  there 
seems  to  be  a  large  discrepancy  between  theory  and  experiment  in  this 
respect  Of  course,  this  is  a  consequence  of  the  fact  that  the  basic  sets 


L..  Jansen  (private  communication). 


are  truncated  in  all  applications,  and  that  the  "remainder  problem"  has  not 
been  investigated.  If  the  inner-core  Bloch  functions  chosen  are  not  particularly 
adapted  for  describing  the  nuclear  region,  one  has  certainly  to  introduce  a 
much  larger  number  of  OPW's  than  used  in  studying  other  properties  of  less 
local  type. 

A  modification  of  the  OPW -method  has  recently  been  suggested  by 
3  ll 

Phillips  cind  Kleinman  '  who  start  out  from  symmetrized  combinatiOt!^S  of 

plane  waves  instead  of  single  waves;  the  method  seems  to  work  very  well 'in 
32) 

the  applicatioxis  '.  In  the  OPW-approach,  it  may  sometimes  also  be 


31) 

32) 


J.C.  Phillips  and  L.  Kleinman,  Phys.  Rev.  116, 

L.  Kleinman  and  J.C.  Phillips,  Phys,  Rev.  116, 
diamond;  117,  460  (1960),  BN;  118,  1153  (1960), 


287  (1959). 

880  (1959), 
Si. 


-19- 


worthwhile  to  use  flexible  auxiliary  functions  instead  of  the  fixed  inner-core 
orbitals  to  speed  up  the  convergency 


33) 


E.  Brovm  and  j.A.  Krumhansl,  Phys.  Rev.  109»  31  (1958)- 


In  Slater's  method  of  augmented  plane  waves  (APW),  the  space 


J.C.  Slater,  Phys.  Rev.  846  (1937);  92,  603  (  1953); 
M.M.  Saffren  and  J.  C.  Slater,  Phys.  Rev.  92,  1126  (1953); 
R.  S.  Leigh,  Proc.  Phys.  Soc.  (London)  A69,  388  (1956). 


around  each  atomic  nucleus  is  divided  into  an  inner  sphere  approximately 
corresponding  to  the  ion  core  and  an  outer  region,  where  plane  waves  are  con¬ 
veniently  used.  The  Schrfldinger  equation  (1)  is  solved  in  both  regions  with 
solutions  of  different  character  which  are  then  Joined  smoothly  on  the  boundary 
spheres.  The  method  shows  very  good  convergency  properties,  and  a  series 
of  important  applications  to  the  problem  of  the  band  structure  of  various  crystal 
has  been  carried  out;  see  Herman  and  Pincherle 

It  has  previously  been  mentioned  here  that  the  tight -binding  method 
introduced  by  Bloch  in  crystal  theory  in  its  most  refined  form  corresponds 
to  the  ASP-MO-LCAO-SCF -method  in  molecular  theory  jn  the  first 

applications,  the  method  did  not  give  any  good  results,  since  one  neglected 
the  overlap  integrals  between  atomic  orbitals  on  neighboring  atoms.  It  turned 
later  out  that  these  overlap  integrals  were  key  quantities  of  essential  importance 
for  the  entire  theory.  The  non-orthogonality  problem  may  be  handled  by  starting 
from  orthonormalized  atomic  orbitals  or  from  Wannier  functions  A 


35) 

36) 

37) 


R.  Landshoff,  Z.  Physik  102,  201  (1936). 

P.O.  Lfiwdin,  ArkivMat.,  Fys. ,  Astr.  35A,  No.  9  (1947);  "A  theo¬ 
retical  Investigation  Into  some  Properties  of  Ionic  Crystals"'.  (Thesis, 
Almqvist  and  Wiksell,  Upsala  1948);  J.  Chem.  Phys.  365  (1950). 

G.K-  Wannier,  Phys.  Rev.  52,  191  (1937). 


-20- 


more  complete  discussibn  of  the  tight-binding  approach  will  be  given  in  Sec.  4. 

The  Wannier  functions  are  the  Fourier  transforms  of  the  Bloch 
functions,  and  they  form  a  complete  set  of  mutually  orthogonal  functions 
localized  around  the  lattice  points  and  connected  by  translational  symmetry . 

They  form  an  excellent  basis  for  investigating  crystal  properties,  and  one  has 

24) 

tried  to  find  direct  methods  for  determining  them;  for  references,  see  Herman  ' 
and  Pincherle  Some  important  new  results  concerning  the  localization  of  the 
Wannier  functions  have  recently  been  obtained  .  Functions  intermediate 
between  Bloch  waves  and  Wannier  functions  have  also  been  introduced 


38} 

39) 


W.  Kohn  and  S.  Michaelson,  Proc.  Phys.  Soc.  (London)  72,  301 
(1958);  W.  Kohn,  Phys.  Rev.  U5,  809  (1959). 

E.C.  Mclrvine  and  A.  W.  Overhauser,  Phys.  Rev.  115,  1531  (1959). 


In  the  Kartfee-Fock  scheme,  the  total  wave  function  (2)  and  the 
density  matrix  (4)  are  invariant  with  respect  to  unitai^  transformations  of  the 
basic  spin-orbitals  v|<|,  •y  *  pointed  out  by  Lennard-Jones 


J.  Lennard-Jones,  Proc.  Roy.  Soc.  (London)  A 198,  1,  14  (1949), 
and  a  series  of  papers  by  Lennard-Jones,  Hall,  and  Pople  during  the 
years  1950-52;  for  detailed  references,  seeG.C.  Hall,  Proc.  Roy. 
Soc.  (London)  213,  113  (1952). 


that,  instead  of  molecular  orbitals  and  Bloch  fimctions,  it  may  sometimes  be 

convenient  to  introduce  a  localized  set  of  orbitals  which  are  all  equivalent  to 

the  atoms  of  the  system.  This  equivalent  orbital  method  has  now  been  applied 

by  Hall  '  for  investigating  the  electronic  structure  of  certain  crystals  of 

diamond  type.  The  problem  of  the  solution  of  the  Hartree-Fock  equations  (l)  in 

42) 

terms  of  localized  orbitals  has  recently  been  studied  also  by  Adams  ' . 


41) 

42) 


G.G.  Hall,  Phil.  Mag.  (7)  43,  338  (1952),  diamond;  Phil.  Mag.  (8) 
3,  429  (1958),  Si,  Ge,  and  diamond. 

W.H.  Adams,  J.  Chem.  Phys.  89  (1961). 


.21. 


I 

I  ' 

( 

1  , 

) 


Let  us  now  return  to  the  Bloch  functions  w  ^  .  As 

3 

previously  shown,  these  functions  are  associated  with  G  points  in  the  space 
of  the  reduced  wave  vector  -ll  ,  and  they  are  orthogonal  ahjii  iion-interacting 
with  respect  to  the  effective  Haxniltonian.  Since  the  number  of  independent  points 
is  so  enormously  large,  one  has  to  treat  only  a  selection  of  k-values  which  are 
usually  chosen  to  correspond  to  symmetiT’  points  in  the  reciprocal  lattice 
In  each  such  point,  one  tries  to  find  the  Bloch  function,  the  energy  ^  ^ 

and  its  first  and  second  derivatives,  and  an  essential  problem  is  then  the 
interpolation  to  intermediate  ^  -values.  This  problem  has  been  attacked  by  a 

'  '  '  '  44t 

simplified  LCAO-method  '  and  by  a  method  based  on  the  use  of  a  pseudo- 
-potential 
results. 


45), 


;  in  all  events,  a  great  deal  of  care  is  necessary  to  get  reliable 


43) 

44) 

45) 


F.C.  ven  der  Lage  and  H.A.  Bethe,  Phys.  ReV.  255  (1944); 

612  (1947). 

J.C.  Slater  and  Q.F.  Koster,  Phys..  Rev.  94,  1498  (1954);  Mi  Miasek, 
Phys.  Rev.  108,  92  (1957). 

J.C.  Phillips,  Phys.  Rev.  112,  685  (1958). 


It  follows  from  the  condition  (16)  that  each  Bloch  fimction  may  be 
written  in  the  form 


t  ,  (39f 

where  u  is  a  function  with  the  periodicity  of  the  lattice,  so  that  A.  fly  ) 

==  .  Instead  Of  determining  the  Bloch  function  within  the  entire  micro¬ 
crystal,  it  is  now  sufficient  to  evaluate  within  a  unit  cell  or  an 

equivalent  region.  It  is  convenient  to  introduce  the  "cellular  polyhedron*^  con- 
sisting  of  all  non- equivalent  points  in  the  ordinary  lattice  having  the  smallest 
value  of  )  ^  I  ;  its  boundaries  are  defined  by  the  relations 

n'-  It  -  m  ,  _  (40) 


Malogous  to  (23),  and  the  "cellular  polyhedron"  in  the  ordinary  lattice  corresponds 


t 


-22- 


apparently  to  the  first  Brillouin  zone  in  the  reciprocal  lattice.  It  follows  from 
(40)  that  the  boundaries  are  the  planes  bisecting  perpendicularly  the  lines 
between  the  origin  and  the  nearest  neighbours  among  its  equivalent  points. 

In  the  cellular  method  developed  by  Wigner  and  Seitz  one  tries 
to  determine  the  function  yt)  by  numerical  integration  in  analogy 

with  Hartree's  treatment  of  atoms  •  Wigner  and  Seitz  assumed  that  it  was 
possible  to  approximate  by  an  s-function  independent  of  H 

but  later  the  importance  of  the  higher  spherical  harmonies  was  emphasized 


46) 

47) 


E.  Wigner  and  F.  Seitz,  Phys.  Rev.  804  (1933);  46,  509  (1934). 

J.  C.  Slater,  Phys.  Rev.  45,  794(1934);  Revs.  Modern  Phys.  6, 

209  (1934). 


and  u  should  actually  be  expanded  in  the  form: 


where  the  radial  functions  should,  in  principle,,  be  determined  by  numerical 
integration.  The  difficulty  of  the  method  is  to  get  the  periodicity  condition 

)  -  U.  satisfied  on  the  boundary  planes  of  the  cellular 

polyhedron  or  at  least  in  a  selected  set  of  symmetry  points  when  the  series 


(41) 


W.  Shockley,  Phys.  Rev.  52,  866  (1937);  F.C.  von  der  Lage  and 
H.A.  Bethe,  Phys.  Rev.  71.,  612  (1947);  W.  Kohn,  Phys.  Rev.  87, 
472  (1952). 


(41)  is  truncated.  It  should  be  observed  that,  if  the  resulting  function 

)  -((.  (^,^)  is  not  a  true  Bloch  function,  it  can  always  be  resolved 
into  Bloch  components  by  using  the  projection  technique  and  forniula  (20).  The 
cellular  method  has  been  applied  to  the  problem  of  bsuid  structure  for  a  series 
of  crystals  of  various  types;  for  references,  see  Herman  and  Pincherle 

The  cellular  method  was  actually  deviced  for  a  study  of  the  cohesive 


-23- 


properties  of  the  alkali  xnetals"^^\  but  in  this  field  it  has  to  a  certain  extent  been 
replaced  by  the  semi-empiric^  quantum  defect  method  introduced  by  Kuhn  and 
Van  Vleck  and  developed  by  Brooks  for  a  survey »  see  Ham 


49) 

50) 


51) 

52) 


See  e.g.  the  survey  by  E.  Wigner,  Proc.  Int.  Conf.  Theor.  Phys. 
Japzm,  649  (Tokyo  1954). 

T.S>  Kuhn.ani  J.H.  van  Vleck,  Phys.  Rev.  79,  382  (1950);  T.S.  Kuhn, 
Phys.  Rev.  79,  515  (1950);  Quart.  Appl.  Math.  %  1  (1951);  Proc.  Int. 
Conf.  Theor.  Phys.  Japan,  640  (Tokyo  1954). 

H.  Brooks,  Phys.  Rev.  91,  1027  (1953). 

F.S.  Ham,  Solid-State  Physics,  U  127  (Academic  Press,  New  York 
1955). 


It  is  a  characteristic  feature  of  most  of  the  present  calculations  within 
the  one-electron  scheme  for  crystals  that  the  potential  in  the  effective  Hamiltonian 
is  assumed  to  be  a  crystal  potential  of  the  periodicity  of  the  lattice  which  is 
derived  from  semi-empirical  arguments  or  theoretical  considerations.  In  the 
Ilartree-Fock  scheme,  the  potential  in  (3)  contains  a  conventionally  periodic 
part  and  an  exchange  term  of  a  non-local  character.  Ths  evaluation  of  the 
effective  Hamiltonian  requires  the  knowledge  of  all  functions  ^ ,  ft  ) 
with  k«values  within  the  Fermi  surface,  which  means  that  a  good  solution  to  the 
interpolation  problem  is  usually  necessary.  It  is  apparently  very  cumbersome 
to  carry  through  a  single  Hartree-Fock  cycle  (5),  not  to  speal:  of  a  series  of 
iterations  of  this  cycle,  and  it  is  hence  extremely  important,  that  one  is  able  to 
start  from  a  good  estimate  of  the  crystal  potential  including  exchange.  Of  course, 
one  hopes  that  the  band  structure  and  other  physical  results  should  not  be  too 
dependent  on  the  specific  choice  of  potential,  but  the  work  by  Howarth  on 


D.J.  Howarth,  Proc.  Roy.  Soc.  (London)  A220,  513  (1953);  Phys. 
Rev.  99,  469  (1955). 


copper  shows  that  this  is  not  always  the  case.  It  seems  hence  important  to  try 
to  reach  the  goal  of  self-consistency  for  a  real  cyrstal,  but  we  note  that,  even 
if  one  obtains  the  exact  Hartree-Fock  functions,  the  corresponding  Slater 
determinant  (2)  is  still  rather  far  from  the  true  many-electron  function. 


-24- 


The  one -electron  scheme  has  up  till  now  been  used  to  determine  the 
spin- orbital  energies  6.  "*  and  the  corresponding  band  stxvcture 

for  a  large  number  of  crystals.  It  has  been  of  essential  importance  as  the 
underlying  theoretical  tool  for  interpreting  experiments  and  it  is  of  great 


54) 


B.  Lax,  Revs.  Modern  Phys.  30,  122  (1958). 


value  for  understanding  the  electric,  magnetic,  optical,  thermal,  and  elastic 
properties  of  solids.  At  the  same  time,  the  present  band  theory  is  certainly 
not  sufficient  to  explain  such  phenomena  as  refer  to  the  solid  as  a  whole  as, 
for  instance,  the  cohesive  properties,  the  relative  stability  of  various  lattice 
types,  the  criterion  for  ferromagnetism,  etc.  The  background  for  this  failure 
will  now  be  discussed. 


(d)  Shortcommgs  of  Band  Theory;  Correlation  Error 

The  one-particle  model  is  based  on  the  idea  that  the  particles  move 

independently  of  each  other.  This  happens,  for  instance,  if  the  total  Hamiltonian 

H  is  separable  in  the  form  H  »  Z  H, ,  and  the  total  wave  function  is  then  a 
op  i  » 

product  of  one-particle  functions  or  spin-orbitals.  In  reality,  the  total  Hamiltonian 
(7)  has  the  form 


(42) 


2 

where  Hj^j  is  a  two-electron  operator:  H^j  =  ®  /^ij  *  Because  of  this  Coulomb 
repulsion,  two  electrons  try  always  to  avoid  each  other  to  keep  the  energy  as 
low  as  possible,  and  this  leads  to  a  certain  "correlation"  between  their  move¬ 
ments.  Since  the  two  electrons  have  actually  to  perform  a  more  complicated 
motion  than  in  the  independent-particle  model,  there  will  be  an  increase  in  the 
kinetic  energy  which  is  compensated  by  a  still  larger  decrease  in  the  Coulomb 
energy;  the  balance  is  regulated  by  the  virial  theorem  <T>  =-^<V>.  One 
can  say  that  each  electron  is  surrounded  by  a  "Coulomb  hole"  with  respect  to 
adl  other  electrons,  and  the  omission  of  this  phenomenon  leads  to  the  correlation 
error  characteristic  for  the  independent -particle -model. 


I 


-25- 


The  correlation  effect  is  most  eaelly  diecueeed  by  meane  of  the  eecond- 
55) 

-order  density  matrix 


55) 


P.O.  Lbwdin,  Phys.  Rev.  97,  1474  (1955);  R.  McWeeny,  Proc.  Roy. 
Soc.  (London)  A232,  114  (1955);  see  also  K.  Husimi,  Proc.  Phys.  - 
Math.  Soc.  Japan  22  ,  264  (1940). 


where  one  should  sum  over  the  N(N-l)/2  possibilities  of  exchanging  the 
coordinates  and  -  as  well  as  and  -  with  the 

coordinates  X;  and  ,  respectively,  in  the  total  wave  function  f  .  The 

diagonal  element  T'' (X,  |  X  |Xx  ^  gives  the  probability  density  to  find 

an  electron  pair  in  the  points  and  A’Z.  )  in 

configuration  space.  The  coulomb  energy  of  the  electron  is  given  by  the  expression 


(44) 


and  the  existence  of  a  "Coulomb  hole"  means  that  the  quantity  'T'  1*1 

should  be  small  when  .  =  |  tends  to  zero. 

A  study  of  the  second-order  density  matrix  shows  that,  if  the  total  wave 
fxmction  is  approximated  by  a  Hartree -product,  there  will  be  no  correlation 
whatsoever  between  the  electrons  1  and  2.  The  situation  is  changed  by  the  anti- 
symmetrization  and,  if  the  total  wave  function  is  approximated  by  a  single 
Slater  determinant,  the  density  matrix  will  become  antisym¬ 
metric  in  each  set  of  its  indices.  This  implies  that  (X|X^  |  X|X^)  'vvrill 

vanish  of  at  least  second  order  for  ,  i.e.  =  0  and  =  l»2  ' 

This  is  the  "Fermi  hole"  for  electrons  with  parallel  spins  '  and,  since  this 


56) 


E.  Wigner,  andF.  Seitz,  Phys.  Rev.  O,  804  (1933);  J.C.  Slater, 
Phys.  Rev.  81,  385  (1951);  V.  W.  Maslen,  Proc.  Phys.  Soc.  (London) 
A69,  734  (1956). 


hole  to  a  certain  extent  replaces. the  Coulomb  hole,  the  ip.ain  part  of  the 
correlation  error  for  electrons  with  paraUel  spins  is  rempyed.  In  the  Hartree- 
-Fock  scheme,  the  essential  correlation  error  is  hence  associated  with  elec¬ 
trons  having  antiparallel  spins. 

In  order  to  get  a  measure  of  the  order  of  magnitude  of  the  correla¬ 
tion  error  in  the  Hartree-Fock  scheme,  it  is  convenient  to  introduce  the  con- 

57) 

cept  of  "correlation  energy"  ',  as  the  difference: 

^corr  = 


E.  Wigner,  Phys.  Rev.  46,  1002  (1934);  Trans.  Faraday  Soc.  34, 

678  (1938);  F.  Seitz,  "Modern  Theory  of  Solids"  (McGraw  Hill,  New 
York  1940)  p.  698  f;  J.C.  Slater,  Revs.  Modem  Phys.  25,  19^1953); 
E.P.  Wohlfarth,.  Revs.  Modern  Phys.  25,  211  (1953);  D.  Pines, 

"Solid  State  Physics"  _1,  368  (Academic  Press,  New  York  1955); 

P.O.  EBwdin,  Adv.  Chem.  Phys.  207  (Interscience,  New  York' 
1959). 


where  eigenvalue  of  the  Hamiltonian  for  the  state  under  con¬ 
sideration  and  the  corresponding  Hartree-Fock  energy.  We  note  that  the 

correlation  energy  is  not  a  physical  quantity  but  a  measure  of  the  error  in  a 
certain  approximation.  Two  aspects  of  the  correlation  problem  will  be  of 
particular  importance: 

a)  the  correlation  error  for  the  equilibrium  state  (R  =  R^) 

b)  the  correlation  error  for  separated  atoms  (  R  «  oo  ) 

where  R  is  a  parameter  indicating  the  internuclear  distances. 

Let  us  start  the  discussion  by  reviewing  some  data  from  atomic  and 

molecular  theory  For  the  series  of  helium -like  ions  (H~,  He,  Li^ . 

4+  2 

C  )  in  their  (Is)  ground  state,  the  correlation  energy  is  remarkably 
68  69) 

constant  '  ''  and  varies  between  -1. 1  and  -1. 2  eV,  whereas  for  the  ground 


H.  Shull  and  P.O.  LBwdin,  J.  Chem.  Phys.  24,  1035  (1956);  30, 
617  (1959). 

A.  FrSman,  Phys.  Rev.  112,  870  (1958). 


59) 


I 


state  of  the  Ne-like  ions  it  lies  around  >11  eV.  For  atoms  and  ions  without 
closed  shells  the  correlation  energy  varies  approximately  linearly  with 


60) 


J.  Linderberg  and  H.  Shull,  J.  Mol.  Spectroscopy^, 


1  (1960). 


the  atomic  number  Z  .  For  the  hydrogen  molecule,  the  correlation  energy  is 
>1.06  eV,  aind  we  note  that,  according  to  the  virial  theorem,  this  quantity  con> 
sists  of  two  parts,  namely  the  correlation  error  in  the  kinetic  energy  and  the 
corresponding  error  in  the  potential  energy: 


•  ■'^corr  - 

Since  1  eV  =  23. 07  kcal/mole,  these  quantities  are  large  from  the  chemical 
point  of  view. 

The  problem  of  the  error  in  the  molecular -orbital  theory  for  separated 
atoms  was  first  investigated  in  a  classical  paper  by  Slater  where  he  studied 
the  connection  between  the  molecular- orbital  approach  and  the  valence-bond 


61) 


J.C.  Slater.  Phys.  Rev.  35,  509  (1930). 


method  by  using  the  hydrogen  molecule  as  an  example.  If  a  amd  b  are  the 

'  c  • 

atomic  orbitals,  the  total  wave  function  in  the  MO-LCAO  method  takes  the  form 


^  a  ,  (47) 


which  implies  that,  for  separated  atoms,  there  is  a  fifty  per  cent  chance  that 
the  molecule  will  dissociate  into  the  ions  H~  and  ,  and  an  equal  chance 
that  it  will  dissociate  into  two  H  atoms.  The  energy  of  the  former,  is.  considerably 
higher  than  the  energy  of  the  latter,  and  the  resulting  error  is  of  the  .order.  8  eV. 

The  weakness  of  the  molecular-orbital  theory  and  of  the  band  theory 
of  solids  is  apparently  that  the  total  wave  function  is  such  that  it  does  not  prevent 
electrons  of  different  spins  to  accumulate  on  the  same  atom  and  give  rise  to 
negative  and  positive  ions  with  higher  energy  than  the  ordinary  dissociation 


-28- 


62) 


J.H.  Van  Vie ck  and  A.  Sherman,  Revs.  Modern  Phys.  7,  167  (1935). 


products.  In  nature,  the  strong  Coulomb  repulsion  between  the  electrons 

prevents  the  formation  of  negative  ions  with  too  many  electrons,  but  apparently 

this  correlation  effect  has  been  neglected  in  the  Hartree-Fock  scheme.  The 

error  is  so  large  that  one  can  speak  of  a  complete  breakdown  of  the  independent- 

63\ 

-particle  model  and  the  molecular-orbital  theory  for  separated  atoms 


C.A.  Coulson,  and  I.  Fischer,  Phil.  Mag.  40,  386  (1949). 


641 

Slater  ’  has  emphasized  that  the  wrong  asymptotic  behaviour  of 

the  singlet  energy  curve  for  R  =  oo  has  a  very  serious  consequence  with  respect 

to  the  study  of  rndguetic  properties.  In  a  state  where  the  electrons  have  parallel 

spins,  the  Pauli-principle  will  prevent  the  formation  of  negative  ions,  and  the 

energy  will  approach  the  correct  value  for  R  =  0  .  The  general  shape  of  the  .  . 

energy  curves  is  indicated  in  Fig.  1.  Since  the  ti'  -curve  has  a  wrong 

asymptotic  behaviour  for  R  =  oo  ,  there  will  always  be  an  artificial  crossing 

point  with  the  ft  -curve,  which  may  lead  to  wrong  conclusions  about  the  general 

magnetic  properties  of  the  system.  This  may  cause  difficulties  in  a  theory  of 

65) 

ferromagnetism  based  essentially  on  band  theory  Apparently  the  difficulty 
comes  from  the  fact  that  the  Hartree-Fock  scheme  treats  electrons  with 
parallel  spins  fairly  well,  whereas  the  study  of  electrons  having  antiparallel 
spins  shows  a  large  correlation  error  which  has  to  be  removed. 


64) 

65) 

66) 


J.C.  Slater,  Phys.  Rev.  82,  538  (1951);  Revs.  Modern  Phys.  25, 
199  (1953);  Encyclopedia  of  Physics  19,  1  (Springer,  Berlin  1956). 

For  a  review,  see  e.g.  E.C.  Stoner,  Rcpts.  Prog.  Phys.  JlJ.,  43 
(1948);  J.  phys.  radium  12,  372  (1951);  E.P.  Wohlfarth,  Revs. 
Modern  Phys.  25,  211  (1953). 

D.  Pines,  Proc.  10th  Sol vay  Conference,  9  (1954). 


The  correlation  error  does  not  always  show  up  in  a  calculation,  which 
depends  on  the  fact  that  we  are  often  interested  in  energy  differences,  and  it 
may  happen  that  the  correlation  errors  associated  with  each  term  to  a  large 


R 


Energy  curves  for  state  of  lowest  and  highest 
multiplicities  as  functions  of  intemuclear 
distance  R  ;  numerical  data  refer  to  H2'  mole¬ 
cule. 


Fig.  1. 


-30- 


extent  cancel*  This  happens,  for  instance,  in  studying  the  cohesive  energy  of 
an  ionic  crystal  of  the  type  of  the  alkali  halides,  since  the  electronic  structure 
of  the  constituents  cUid  of  the  free  ions  are  similar,  and  the  correlation  energy 
of  the  crystal  is  then  approximately  equal  to  the  correlation  energy  of  the  free 
ions. 

On  the  other  hand,  there  is  certainly  no  such  cancellation  in  an 

investigation  of  the  cohesive  energy  of  the  alkali  metals.  The  correlation  error 

57  4.8) 

for  this  case  has  been  studied  in  great  detail  by  Wigner  ’  who  derived  the 
correlation  energy  formula 

—  0  4S&  - - -  , 

where  all  quantities  are  expressed  in  atomic  units.  For  the  alkali  metals  Li. 
Na,  K,  one  obtains  the  following  values  for  the  correlation  energy  per  doubly 
filled  orbital,  namely  -1.89,  -1.73,  -1.58  eV.  respectively. 

According  to  Wigner,  the  correlation  energy  should  essentially  be 
a  function  of  the  electron  density.  Of  particular  importance  is  Wigner's  study 
of  the  low  density  limit  which  is  based  on  the  plasma  model,  in  which  the  elec¬ 
trons  in  a  crystal  are  approximated  by  an  electron  gas  moving  in  a  "uniform 
positive  backgro\md".  For  sufficiently  low  density,  the  electrons  will  form  a 
body-centered  cubic  lattice  with  interesting  properties 


67) 


W.J.  Carr  Jr.,  Phys.  Rev.  112,  1437  (1961). 


The  plasma  model  has  later  been  strictly  treated  by  Bohm  and  Pines 
using  field-theoretical  methods.  According  to  classical  discharge  theory,  such 


68) 


68) 


For  a  survey,  see  D.  Pines,  Phys.  Rev.  92,  626  (1953)  and  reference 

66.  . 


a  plasma  shows  a  collective  oscillatory  behaviour  with  the  fundamental  frequency 

z  4’ 

Wp  =  (4ir(n^e  /m)*  ,  where  is  the  average  electron  density.  The  field- 

-theoretical  study  of  the  electronic  cox-relation  showed  a  long-range  effect 
corresponding  to  the  plasma  oscillations  and  a  short-range  effect  giving  raise 
to  an  efficient  electronic  screening,  which  later  has  become  of  large  importance 
in  the  so-called  "dielectric  approximation". 


-31- 


Since  in  the  simple  plasma  model  there  are  no  discrete  nuclei,  such 
aspects  of  the  correlation  problem  as  are  concerned  with  the  atomic  con¬ 
stituents  of  a  crystal  will  not  be  treated  whatsoever.  The  problem:  Of  the 
asymptotic  behaviour  of  the  energy  for  separated  atoms  so  strongly  emphasized 
by  Slater  camnot  be  treated  at  all  within  the  framework  of  this  model.  In  -  > 
the  atomic  approach,  the  correlation  energy  is  certainly  not  a  function  of  the 
electronic  density  only  and,  as  an  example,  we  would  consider  the  series  of 
helium-like  ions  which  all  have  the  same  correlation  energy,  but  which  goes 

from  the  extremely  extended  H  ion  to  the  highly  concentrated  positive  ions, 

4+ 

like  C  .  Even  if  the  simple  plasma  model  has  given  very  interesting  and 
important  results  concerning  the  behaviour  of  the  mobile  electrons  in  metals, 
it  has  so  far  not  given  the  ultimate  answer  to  the  problem  of  the  correlation 
error  in  the  band  theory  of  ordinary  crystals  with  discrete  atomic  nuclei.  This 
question  will  be  further  discussed  below. 


3.  VALENCE  BOND  METHOD 
(a)  Covalent  Bond;  Valence  Bond  Fimctions 

Crystal  physics  can  be  approached  from  an  entirely  different  point 
of  view  than  band  theory.  In  connection  with  e.g.  cohesive  properties,  it  seems 
natural  to  start  from  the  chemists'  ideas  of  bonding  between  atoms  to  describe 
the  binding  of  the  constituents  of  a  crystal,  and  this  leads  to  the  valence  bond 
method.  According  to  Lewis,  each  covalent  bond  is  associated  with  an  electron 
pair  which  causes,  the  binding,  but  the  real  nature  of  the  bond  was  not  revealed 
until  the  establishment  of  modern  quantum  mechanics.  In  connection  with  the 
problem  of  the  helium  atom,  Heisenberg  had  discovered  the  exchange 
phenomenon  and  the  identity  principle  which  says  that  it  is  physically  impossible 
to  distinguish  between  the  individual  electrons.  In  modern  terminology,  it 
means  that  the  permutation  operator  P,.,  is  a  constant  of  motion,  so  that 

701 

=  .HP^2  ’  investigating  the  hydrogen  molecule,  Heitler  and  London  ' 
found  that  the  bonding  of  the  atoms  depended  on  this  exchange  effect  and  had 
hence  essentially  a  quantum  mechanical  character. 


69) 

70) 


W.  Heisenberg,  Z.  Physik  38,  411  (1926);  39,  499  (1926)^ 
W.  Heitler  and  F.  London,  Z.  Physik  «,  466  (1927). 


-32- 


Let  I  be  a  space  function  which  describes  the 

physical  situation  of  an  electron  pair.  By  means  of  the  identity 


(47) 


where  each  term  in  the  right-hand  member  is  a  projection  operator,  one  can 
resolve  this  function  into  its  symmetric  and  antisymmetric  components  with 
respect  to  are  orthogonal  jtnd  non-interacting  with  respect  to  H  . 

The  symmetric  space  component  is  associated  with  the  singlet  state,  and  the 
antisymmetric  space  component  with  the  triplet  state  and,  for  the  corresponding 
energies,  one  obtains 


<■*>11  +  p  1  #>  ' 

(48) 

■’E  = 

1  1  —  -P  1 

(49) 

which  quantities  should  be 

compared  with  the  expectation  value 

,  which  always  lies  between  them. 

In  this  connection. 

it  is  convenient  to  introduce  the  exchange  integral; 


which  may  then  be  used  as  a  criterion  for  the  spin  alinement.  If  J  >  0  one  has 

1  3 

E  >  E  and  parallel  spins  in  the  ground  state,  whereas,  for  J  >  0  ,  one  has 
1  3 

E  <  E  and  .antiparallel  spins  in  the  ground  state.  According  to  this  simple 
model,  the  exchange  integral  would  then  give  the  criterion  for  ferromagnetism 
versus  antiferromagnetism,  if  the  concept  could  be  generalized  to  ci^stals. 
Substitution  of  (48)  and  (49)  into  (50)  gives  the  expression: 


-33- 


Originallyr  the  valence  bond  theory  was  based  on  the  one-electron 
approximation  according  to  which  one  has  f  (1«2}  s  a(l)  b(2)  Whez‘6  a  and>' 
b  are  two  atomic  orbitals  (AO's)  associated  with  the  two  constituents.  The 
quantity  a[b>  is  known  as  the  "overlap  integral"  and  plays  an  impor¬ 

tant  role  in  the  theory.  We  note  that  one  cannot  start  out  from  two  orthogonalized 

-  -  7 1) 

AO's  ,  a  and  b  since  the  singlet  would  then  not  show  any  bonding  the 

exchange  Integral  J  would  further  be  positive,  so  that  the  triplet  would  be  the' 

ground  state.  The  overlap  problem  is  hence  very  essential. 


71) 


J.C.  J*  Chem.  Phys.  19>  220  (l95l). 


A  careful  analysis  of  the  connection  between  the  band  theory  or  MO- 
-method  and  the  valence  bond  (VB)  scheme  was  made  by  Slater  who  used 
the  H2'' molecule  as  a  typical  example.  He  showed  that  the  VB -method  including 
polar  states,  a(l)  a(2)  and  b(l)  b(2),  would  give  the  same  result  as  the  MO- 
-method  including  configurational  interaction  between  the  bonding  orbital  (a  +  b) 
and  the  anti-bonding  orbital  (a  -  b).  However,  in  their  original  and  naive  forms, 
the  two  approaches  are  certainly  not  equivalent.  For  the  equilibrium  state 
(R  s  R^),  they  lead  to  rather  similar  results,  whereas  for  separated  atoms 
(R  »  00) ,  the  naive  VB-method  is  superior  to  the  naive  MO-method,  since  the 
former  gives  a  correct  asymptotic  behaviour  Of  the  singlet  Onergy  curve.  In 
this  respect,  there  is  less  correlation  error  in  the  naive  valence  bond  method 
than  in  the  ordinary  band  theory. 

The  total  wave  function  for  a  valence  bond  singlet  associated  with  an 

orbital  pair  (a,b)  may  be  written  in  the  form  Aa,b-(a -  3 ,a,)  where  A 

I  I  ^  I  ^  72'73l 

is  the  antisymmetrization  operator.  This  construction  is  easily  generalized  ’  ' 


72) 

73) 


W.  Heitler  and  G.  Rumer,  Gdttinger  Nachr.  1930,  277. 
G.  Rumer,  GOftinger  Nachr.  1932,  337* 


to  a  many-electron  system  having  the  orbital-pairs  (ab) ,  (cd) ,  (ef),  •> .  *  etc. , 
and  the  total  valence-bond  singlet  is  given  by  the  expression  .  - 


-34- 


where  there  is  one  spin  singlet  (aP  -  pa)  for  each  orbital  pair.  The  collection 

of  orbitals  a,  b,  c,  d,  e,  f,  ...  may,  of  course,  be  paired  in  many  different 

ways,  and  each  one  gives  rise  to  a  valence  bond  singlet.  The  correct  number 

of  linearly  independent  valence  bond  singlets  may  be  found  by  means  of  Riimer's 
73  74^ 

non-crossing  rule  *  '  for  the  valence  bonds.  There  is  a  close  parallelism 

between  the  quantum-mechanical  wave  function  and  the  corresponding  chemical 
formula  for  the  compptmd,  which  has  been  fur&er  developed  in  the  theory  of 
chemical  resonance 


74) 

75) 


L>.  Pauling,  J.  Chem.  Phys.  280  (1933). 

J.C.  Slater,  Phys.  Rev.  37  ,  481  (1931)^  particularly  p.  489, 

Li.  Pauling,  J.  Chem.  Phys.  U  280  (1933),  and  a  series  of  papers 
in  J.  Chem.  Phys.  and  J.  Am.  Chem.  Soc. 


In  the  case  when  the  overlap  integrals  between  the  orbitals  a,  b,  c,  d, 

. .  .  are  neglected,  the  expectation  value  of  the  total  energy  . and  its  matrix 
elements  with  respect  to  the  valence  bond  singlets  are  fairly  easily  evaluated 
However,  this  approach  will  not  describe  chemical  bonding  unless  the  overlap 
integrals  are  included,  and  it  turns  then  out  to  be  extremely  cumbersome  to 
calculate  the  elements  of  the  energy  matrix  The  best  way  to.  solve  this 
problem  systematically  seems  to  be  to  resolve  the  valence  bond  singlets  into 
spin-projections  of  Slater  determinants  The  valence  bond  singlets  are  hence 
physically  simple  but,  with  respect  to  the  energy,  mathematically  complicated. 


77) 


See  e.g.  J.C.  Slater,  Quarterly  Progress  Report  of  Solid-State 
and  Molecular  Theory  Group,  M.I.T.,  p.  3,  October  15,  1953 
(unpublished). 

P.O.  LiSwdin,  Technical  Hbte  2,  Uppsala  Quantum  Chemistry  Group 
(1957);  Coll.  Int.  Centre  Hat.  Rech.  Sci.  82,  23,  Paris  1958. 


If  the  overlap  problem  is  difficult  for  a  molecule,  it  becomes  almost 
prohibitive  for  a  crystal.  It  was  pointed  out  by  Slater  that  the  inclusion  of 
the  .overlap  integrals  in  the  application  of  the  VB -method  to  crystals  would 
lead  to  divergency  difficulties  of  such  a  severe  type  that  one  has  later  called 
it  a  "non- orthogonality  catastrophe"  Actually,  each  matrix  element  of 


78) 


D.R.  Inglis,  Phys.  Rev.  46,  135  (1934). 


-35- 


the  energy  is  of  the  form  oo/oo  but,  in  the  denominator  and  the  numerator, 
there  is  a  common  infinite  factor,  and  the  reniaining  quotient  is  well-behavedt 
This  problem  is  still  not  completely  solved  in  sdl  details,  amd  we  will  conament 
more  about  it  below. 

Another  problem  in  the  VB -theory  for  treating  crystals  is  that 
apparently  the  polar  states  are  of  fundamental  importance  ,  particularly  in 
connection  with  conductivity  phenomena.  The  basic  theory  shows  many 
interesting  aspects  but  is  rather  complicated  in  the  applications.  A  simplifi¬ 
cation  of  this  approach  could  be  obtained,  if  one  could,  in  principle,  include 
all  polar  states,  since  one  could  then  use  orthogonallzed  atomic  orbitals 
or  Wunier  functions  as  a  basis 


S.  Schubin,  and  S.  Wonssowsky,  Proc.  Roy.  Soc.  145,  159  (1934); 
Phystk.  Z.  Sowjetunion  7,  292  (1935);  J[0,  348  (1936); 

S.  Wonssowsky,  Fortschritte  der  Physik  _!.»  239  (1954). 

For  a  study  of  the  molecular  case,  see  R.  McWeeny,  Proc«  Roy.  Soc. 
(London)  A223,  63.  306  (1954). 


8lV  ' 

Starting  from  the  chemists'  point  of  view,  Pauling  ^  has  developed 
a  resonating -valence -bond  theory  of  metals,  which  seems  to  be  remarkably 
successful  as  a  semi-empirical  device.  A  valence-bond  treatment  based  on 
the  use  of  bond  orbitals  instead  of  atomic  orbitals  should  also  be  mentioned. 


L.  Pauling,  Nature  161,  1019  (1948);  Proc.  Roy.  Soc.  A 196 ,  343 
(1949);  Phy Sica  J5.  23  (1949). 

C.A.  Coulson,  Proc.  Int.  Conf.  Theor.  Phys.  Japan  629,  (Tokyo 
1953). 


It  has  been  pointed  out  above  that  valencerbond  method  including 
polar  states  and  molecular-orbital  method  including  configurational  interac¬ 
tion  lead  to  identical  results  that  the  methods  in  their  simple  original 
form  are.  rather  different,  and  that  the  naive  VB -method  seems  superior  to 
the  naive  MQ-method  in  treating  correlation  effects.  In  order  to  explain  thf 
peculiar  behaviour  of  crystals  like  NiO,  which  are  insulators  but  still  have 


-36- 


incompletely  filled  bands,  Mott  raised  the  question  whether  the  simple  valence 
bond  method  is  particularly  well  suited  for  certain  classes  of  crystals  (insulators) 
and  the  band  theory  for  other  classes  (conductors).  One  could  think  that  correla¬ 
tion  effects  would  be  more  important  in  insulators  than  in  conductors,  but  these 
effects  are  probably  just  as  essential  in  all  types  of  crystals.  This  problem  will 
be  further  discussed  in  Sec.  5 


83) 


N.F.  Mott,  Proc.  Phys.  Soc.  (London)  A62,  4l6  (1949). 


(b)  Dirac -Van  Vleck  Vector  Model 

In  the  study  of  the  magnetic  properties  of  crystals,  the  valence-bond 
method  luis  been  used  in  a  particular  form  known  as  the  Dirac-Van  Vleck  vector 
model  In  this  approach,  the  spin-degeneracy  problem  of  a  many-electron 


P.A.M.  Dirac,  Proc.  Roy.  Soc.  (London)  A123,  714  (1929); 

J.H.  Van  Vleck,  "Theory  of  Electric  and  Magnetic  Susceptibilities" 
(Oxford  University  Press,  London  1932);  Phys.  Rev.  45,  405  (1934). 


system  is  investigated  under  the  assumption  that  the  space  part  is  characterized 
by  a  set  of  orbitals  a,  b,  c,  d,  ...  and  that  one  has  integrated  over  the  space 
coordinates.  The  splitting  of  the  energy  levels  is  then  given  by  the  eigenvalues 
to  the  spin  Hamiltonian: 


C  };:  \ 

•\.<i  d  0  (1 


(53) 


which  works  in  the  spin- space  only;  here  is  an  average  energy,  and  the 

coefficients  J. .  are  the  exchange  integrals.  This  formalism  has  been  success- 

U  85)  86) 

fully  utilized  in  the  spin-wave  model  ^  and  in  the  theory  of  superexchange 

The  original  derivation  was  based  on  the  assumption  that  the  orbitals 
a,  b,  c,  d,  ...  were  all  orthogonal  and  the  entire  approach  has  been  critized 
by  Slater  on  this  ground.  The  simple  example  of  two  electrons  shows  that, 
if  the  orbitals  a  and  b  are  assumed  to  be  orthogonal,  one  could  neither  discuss 


I 

l 

■? 

i 

I 

! 

j 

I 

I 


-37- 


85) 


86) 

87) 


H*A.  Bethe,  Z.  Physik  7 1,  205  (1931);  L*  Hulth^n,  Arkiv  f.  mat., 
astr.,  fysik  26A,  11  (1938);  P.W.  Anderson,  Phys.  Rev.  86,  694 
(1952);  R.  Kubo,  Phys.  Rev.  87,  568  (1952);  F.  Dyson,  Phys.  Rev. 
102,  1217  (1956);  J.  van  Kranendonk  and  J.H.  Van  Vleck,  Revs. 
Modem  Phys.  1  (1958);  F.  Bopp  and  E.  Werner,  Z.  Physik  151, 
10  (1958);  wd  others. 

H.A.  Kramers,  Physica  _1,  182  (1934);  P.  W.  Anderson,  Phys.  Rev. 
79,  350  (1950);  for  further  references,  see  e.g.  P.W.  wAndersoiii, 
Phys.  Rev.  115,  2  (1959). 

J.C.  Slater,  Revs.  Modern  Phys.  25,  199  (1953). 


magnetic  alinement  nor  bonding.  The  remedy  is  to  use  overlapping  orbitals  or 

to  include  polar  states  The  "non-orthogonality  catastrophe"  in  connection 

6 1  7  8V 

with  the  overlap  integrals  in  crystal  theory  has  previously  been  mentioned  *  , 

and  a  long  series  of  papers  has  now  been  written  on  this  subject 


R.  Serber,  J.  Chem.  Phys.  697  (1934);  Phys.  Rev.  461  (1934). 

J.H.  Van  Vleck,  Phys.  Rev.  «,  232  (1936);  P.O.  LOwdin,  J.  Chem. 
Phys.  J^,  365  (1950);  W.J.  Carr  Jr.,  Phys.  Rev.  28  (1953); 

Y.  Mizuno  and  T.  Izuyama,  Progr.  Theoret.  Phys.  Japan  22,  344 
(1959);  F.  Takano,  J.  Phys.  Soc.  Japan  14,  348  (1959),  T.  Aral 
(unpublished). 


It  should  be  observed  that  it  may  be  quite  possible  to  incorporate  noh- 

-orthogonality,  polar  states,  correlation  effects,  etc.  in  the  vector  model  in  a 
90) 

simple  way  For  a  two-particle  system,  one  has  a  singlet  and  a  triplet  state 


90) 


P.O.  Lbwdin,  Technical  Note  46,  Uppsala  Q\iantttm  Chemistry  Group; 
Revs.  Modern  Phys.  34,  1  (1962). 


and  the  identity 


(54) 


-38- 


1  3 

where  E  and  E  could  be  the  true  energies,  J  is  the  exchange  integral 
defined  by  (50)  and  (51),  and  K  =  -I  for  the  singlet  state  (S  s  0)  and  K  s  + 1 
for  the  triplet  state  (S  =  l).  The  quantity  K  may  be  considered  as  a  spin 
operator  which  has  the  saixle  eigenvalues  and  eigenfunctions  as  the  operator 
(  ^  -64,  )  and,  according  to  (54),  one  obtains 


(55) 


which  is  the  spin  Hamiltonian  desired.  The  question  whether  this  approach 

could  be  generalized  to  more  electrons  is  now  being  investigated.  If  this  is  the 

case,  the  vector  model  would  certainly  form  a  good  basis  for  a  semi-empirical 

85  86) 

theory  fully  in  line  with  the  applications  carried  out  so  far 


(c)  Extension  of  Valence-Bond  Method 

In  chemistry,  the  concept  of  the  covalent  bond  is  of  such  a  fundamental 
importance  that  it  seems  highly  desirable  to  try  to  obtain  a  simple  and  useful 
formulation  of  the  VB -method  free  of  the  previously  mentioned  mathematical 
difficulties  connected  with  the  overlap.  As  indicated  in  the  discussion  in  coimec- 
tibn  with  equations  (48)-(5l),  the  basic  space  function  ^  ^  the 

VB -method  is  essentially  a  two-electron  function,  and  there  is  no  necessity 
of  using  the  orbital  approximation.  The  corresponding  valence  bond  singlet  would 
then  have  the  form  ^  )  .  For  a  many-electron  system 

having  the  bonds  (ab),  (cd),  (ef),  . . .  with  the  space  functions  , 

one  would  instead  of  (52)  get  the  imore  general  valence  bond  singlet 

where,  in  each  bond  function,  one  could  include  the  overlap,  the  polar  states, 
and  the  full  correlation  effects  in  each  bond. 

Such  a  two-elcctron  extension  of  the  valence-bond  method  has  been 

91) 

worked  out  by  Hurley,  Lennard-Jones^  and  Pople  The  overlap  associated 


91) 


A.  C.  Hurley,  J.  Lennard-Jones,  and  J.  Pople,  Proc.  Roy.  Soc.  London 
A220,  446  (1953). 


-39- 


with  a  specific  bond  does  not  cause  any  difficulties,  but  there  is  an  overlap  be¬ 
tween  the  functions  associated  with  different  bonds  which  leads  again  to  consid¬ 
erable  mathematical  complications.  In  order  to  simplify  the  theory,  one  has 
sometimes  introduced  the  assumption  of  strong  orthogonality  between  the  bonds: 

■»«,  -  0  ^  (57) 

which  means  that  the  bonds  to  a  certain  extent  are  independent  of  each  other. 

92) 

The  implications  of  this  condition  have  recently  been  studied  in  detail 


T.  Aral,  J.  Chem.  Phys.  33,  95  (1960);  P.Q.  LSwdin,  J.  Chem.  Phys. 
35,  78  (1961). 


The  extended  VB-method  has  been  successfully  applied  to  crystals:  to 

93) 

a  study  of  diamond  by  Schmid  '  and  to  an  inve  stigation  of  Zn  S  by  Asano  and 
94) 

Tomishina  '.  In  molecular  theory,  this  approach  has  become  known  under  the 
name  of  "perfect-pairing  approximation" 


93) 

94) 

95) 


L.A.  Schmid,  Phys.  Rev.  92,  1373  (1953);  Am.  J.  Phys.  W,  255  (1954). 

S.  Asano  and  Y.  Tomishina,  J.  Phys.  Soc.  Japan  11,  644  (1956). 

See  e.g.  R.G.  Parr,  F.O.  Ellison,  andP.G.  Lykos,  J.  Chem.  Phys. 

1106  (1956);  J.M.  Parks  and  R.G.  Parr,  J.  Chem.  Phys. 

335  (1958);  R.  McWeeny  and  K.  A.  Ohno,  Proc.  Roy.  Soc.  (London) 
A225,  367  (i960);  R.  McWeeny,  Revs.  Modern  Phys.  32,  335  (1960). 


-40- 


4.  TIGHT -BINDING  APPROXIMATION 
(a)  Basic  Problems 

The  tight-binding  approximation  introduced  in  crystal  theory  by  Bloch  ' 
is  a  band  theory  using  the  atomic  orbitals  of  the  constituents  as  a  basis,  and  it 
corresponds  in  its  most  refined  form  to  the  ASP-MO-LCAO-SCF  method  in 
molecular  theory  The  nature  of  the  tight-binding  scheme  in  general  has 

been  briefly  discussed  previously  in  this  review  and,  in  this  section,  we  will  , 
concentrate  our  interest  on  some  basic  problems  of  particular  importance  con¬ 
nected  with  this  approach.  Since  the  valence-bond  method  is  often  based  on 
atomic  orbitals,  some  of  the  problems  are  common  to  both  approaches. 

9\ 

Approximate  linear  dependencies.  -  The  fundament  of  Ritz’s  method  '  for 
solving  eigenvalue  problems  was  discussed  in  Sec.  2c.  If  {f^  }  is  a  set  of 
functions  forming  a  complete  basis,  the  Schrddinger  equation  is  equivalent  to  a 
system  of  linear  equations  (36)  with  the  secular  determinant 

We  note  that,  if  some  of  the  functions  in  the  set  { f  }  would  be  linearly 
dependent  so  that  Z  f^^  a^  =0  for  some  non-vanishing  coefficients  a^  ,  the 
rows  and  columns  in  this  determinant  would  also  be  linearly  dependent,  which 
implies  that  the  secular  determinant  would  be  identically  vanishing  for  all  values 
of  the  parameter  C  .  In  order  to  be  able  to  use  the  secular  equation  for 
determining  the  eigenvalues  ^  ,  one  has  thus  to  be  sure  that  the  functions  in 

the  basis  }  are  linearly  independent. 

In  this  connection,  it  is  convenient  to  introduce  a  certain  measure  p 
for  the  degree  of  linear  independence  defined  by  the  minimum  of  the  quantity 

A.=yi  n  , 

where  the  coefficients  a^^  are  subject  to  the  auxiliary  condition  Z  |  a^ 
which  means  that  they  cannot  all  simultmeously  be  vanishing.  For  d  one  has 
the  alternative  form 

J  (a'l'ACL 


(60) 


-41- 


with  the  auxiliary  condition  removed,  and  we  can  hence  draw  the  conclusion  that 
^  is  the  smallest  eigenvalue  of  the  metric  matrix  A  which  is  positive 
definite.  If  p  =  0,  the  set  }  is  linearly  dependent,  whereas,  if  p  ^  0,  the 
set  is  linearly  independent  and  everything  is  in  order,  at  least  in  the  sense  of 
ordinary  mathematics. 

However,  in  any  numerical  application  of  Ritz's  method,  one  can  use 
only  a  finite  number  of  figures.  This  means  that,  if  JiL  is  smaller  than  the 
rounding-off  error,  the  basic  set  is  approximately  linearly  dependent,  and  the 
corresponding  secular  equation  (58)  will  be  identically  vanishing  within  the 
accuracy  used.  If  the  quantity  p  is  small  but  not  necessarily  vanishing,  one 
has  often  a  corresponding  loss  of  significant  figures  in  the  calculation  of 
The  occurrence  of  approximate  linear  dependencies  is  hence  a  very  serious 
problem  from  practical  points  of  view. 

This  problem  is  not  limited  to  the  tight-binding  approximation  but  is 
of  a  very  general  nature  An  investigation  of  some  of  the  standard  radial 


P.O.  L«wdin,  Ann.  Rev.  Phys.  Chem.  JT,  107  (i960). 

sets  { r”'  }  ,  {r”"^e  ^} ,  },  etc.  for  n=  1,  2,  3,  ...»  shows 

that  the  corresponding  measures  p  quickly  become  exceedingly  smadl,  and  that 
the  sets  are  actually  to  a  high  extent  approximately  linearly  dependent. 

As  another  typical  example,  we  will  consider  the  set  of  powers 
2  3 

1,  x,  x  ,  X  ,  ...  for  -1  ^  X  —  +  1 ,  which  is  often  used  in  studying  e.g. 
angular  behaviour  with  x  =  cosO  .  From  mathematics,  we  know  that  this  set 
is  complete  and  linearly  independet,  but  an  investigation  of  p  reveals  that  the 

set  quickly  becomes  approximately  linearly  dependent.  Since  the  even  powers 

2  4  -35 

1,  X  ,  X  ...  are  orthogonal  to  the  odd  powers  x,  x  ,  x  ,  .  . .  ,  there  are 

actvially  two  orthogonal  subsets  which  can  be  treated  independently.  The  smallest 

eigenvalue  p  of  the  metric  matrix  A  is  given  in  Table  I  as  a  function  of  the 

number  of  functions  in  the  subset,  and  the  result  is  perhaps  somewhat  surprising. 

It  tells  us  that  one  has  to  be  extremely  careful  in  using  a  non-orthogonal  basis 

>  in  applying  Ritz's  method  in  molecular  and  crystal  theory.'  Since  it 

seems  as  if  the  remedy  would  be  a  transformation  of  the  basis  to  an  orthonormal 

set,  we  will  continue  with  a  brief  study  of  such  procedures. 

The  phenomenon  of  the  almost  identically  veuiishing  secular  equation 

97) 

was  first  observed  in  crystal  theory  by  Parmenter  '  in  a  tight-binding  study 


-42- 


TABLE  I.  Lowe  at  eigenvalue  u  of  matrix  s  <x^  x'^>  for  the  interval 

P*1  _9 

-l^x^-l-1:  ns  number  of  members  fci  each  set.  Units  10 


n 

Even  aet 

V- 

n 

Odd  aet 

2 

79  316  688 

■ 

33  154  158 

3 

3  275  556 

H 

1  254  936 

4 

117  839 

■ 

43  655 

5 

4  002 

5 

1  451 

6 

131 

6 

45 

7 

5 

7 

1 

8 

1 

8 

1 

The  author  la  indebted  to  F.K.  Klaua  Appel  and  F.K.  Einar 
Lundqviat  for  carrying  out  the  numerical  calculationa  involved 


-43- 


97) 


R.H.  Parmenter,  Phys.  Rev.  86,  552  (1952). 


of  the  lithium  metal  using  Gaussian  ftmctions  as  atomic  orbitals. 


Orthonormalization  procedures. 

metric  matrix  2^  with  the  elements  A  =<f  if  > 

_  mn  m '  n 

general  linear  trainsformation  H 


Starting  from  the  basis  }  having  a 

we  will  now  study  the 

which  transforms  this  basis  to  another 

which  is  orthonormal,  so  that  <cp_|o  >  =  8_  •  Using  matrix  notations, 

T ixi'  m '  n  mn 

we  will  write  the  transformation  in  the  form  ,  or  q>^  ~  '^om' 

Since  *  '4  and  «  A  ,  one  obtains  directly  Uie  condition 


one  is  lead  to  the  equation 


A+AA  »  i  .  Substituting 
jat.3  »  1  and,  since  the  transformation  should  be  non- singular,  is 

a  unitary  matrix.  The  general  orthonormalization  procedure  has  hence  the  form 


36) 


J 


(61) 


where  ^  is  an  arbitrary  unitary  matrix.  If  /\  is  chosen  triangular,  one 

obtains  Schmidt's  classical  procedure  of  successive  orthogonalization  which  is 

more  simply  derived  directly.  If  ^  is  chosen  equal  to  1  one  obtains, 

33  3^)  ^ 

the  symmetric .  orthonormali^ation  '  ,  in  which  all  functions  in  the  basis 

{f  ^  }  are  treated  in  an  equivalent  way.  In  this  case,  it  is  essential  to  evaluate 
the  matrix  .  Putting  A  *  1  ,  where  ^  is  the  overlap  matrix 

of  the  basis,  one  has  the  formal  expansion 

l-iS  -f  s-f  (6a) 


which  is  convergent,  if  the  overlap  is  sufficiently  smadl,  for  instance 

£|s  |<^  .  Fox'  many  crystals,  the  series  (62)  is  divergent,  and  one  has  then 

* 


to  use  more  forceful  methods  to  evaluate 


The  metric  matrix  A  is  hermitean  and  positive  definite,  and  we 
will  let  U  be  the  unitary  matrix  which  brings  it  to  diagonal  form  d  ,  so 
that 


) 


(63) 


-44'- 


d,  are  positive  and  the  smallest  one  gives  the 
measure  p  of  linear  independence.  The  matrix  A  ^  may  now  be  defined 
by  the  relation  A 
square  roots  in 


where  all  the  eigenvalues 

r  indepe 


where  one  can  chobse  e.g.  the  positive 
one  can  prove  some 


interesting  theorems  about  the  set  -  J  .  It  has  further  been 

shown  that,  if  the  basis  {fj^  }  undergoes  a  unitary  transformation 

then  the  set  {<p  }  undergoes  the  same  tretnsformatipn. 

L 


98) 

99) 


G-  W.  Pratt  Jr. ,  andS.F.  Neustadter,  Phys.  Rev.  101,  1248  (1956); 
B.  &.  Carlson  and  J.M.  Keller,  Phys.  Rev.  105,  102  (1957); 

P.G.  Liykos  and  H.N.  Schmeislng,  J.  Chem.  Phys.  288  (1961). 

j.C.  Slater  and  G.F.  Koster,  Phys.  Rev.  9j^,  1498  (1954). 


It  is  clear  that,  unless  the  series  (62)  is  rapidly  convergent,  the 
calculation  of  the  matrix  is  a  cumbersome  procedure,  particularly  ‘ 


for  a  crystal.  Using  the  Chebyshev  polynomials,  one  has  recently  obtained  a 


considerable  simplification  of  this  problem  by  deriving  a  closed  expression 
for  the  elements  of  A  for  an  infinite  (periodic)  chain  and,  by  using  per 
turbation  technique,  the  same  method  can  be  extended  to  three  dimensions. 


lOp) 


^0®)  p.O.  L8wdin,  R.  Paunca,  and  J.  de  Heer,  J.  Math.  Phys.  _1,  46l 
(1960). 


In  discussing  the  symmetric  orthonormalization,  we  have  assumed 
that  the  basis  (f^^  }  is  linearly  independent,  so  that  p  0  and  A  ^  exists. 
In  order  to  treat  also  the  case  of  exact  and  approximate  linear  dependencies, 
it  is  convenient  to  choose  *  V  in  (61),  which  leads  to  the  canonical 

orthogonalization  or 


101) 


P.O.  LOwdin,  Advances  in  Physics 


5, 


1  (1956),  p.  49-56. 


-45- 


which  formula  is  valid  for  all  dj^  /  0  .  It  may  be  convenient  to  arrange  this 
set  according  to  decreasing  values  of  dj^  ;  the  sum  of  the  absolute  'squares  of 
the  coefficients  in  (64)  equals  the  set  (64)  has  an  opUmum  property 

in  this  connection. 

This  meems  that,  even  if  one  goes  over  to  an  orthonormal  set,  the  , 
approximate  linear  dependencies  will  still  show  up  in  the  calculations:  the  sum 
of  the  absolute  squares  of  the  coefficients  in  the  last  function  will  be  p  ^  ,  i.  e. 
the  coefficients  will  usually  be  very  large  at  the  same  time,  as  they  have  a  small 
number  of  significamt  figures.  However,  formula  (64)  gives  us  at  least  a 
possibility  of  refining  the  calculations  within  a  certain  accuracy  by  striking 
away  those  functions  9^  as  correspond  to  too  small  eigenvalues  dj^ ,  but  the 
completeness  of  the  basis  is  then  gone.  The  finite  number  of  bits  of  our  elec-/ 
tronic  computers  (or  desk  machines,  etc.  )  puts  us  hence  in  a  dilemma,  which 
has  not  yet  been  solved. 


In  conclusion,  it  should  be  added  that,  in  crystal  theory,  it  is  often 
h(gh^  convenient  to  use  one  more  method,  namely  the  successive  orthbnorm^iza- 
tion  of  groups  of  functions.  Let  ^  and  ^  represent  two  groups  of  func¬ 
tions  having  the  metric  matrix 


1  J 


(65) 


where  -S  ~  A' ^  ^  quadratic  or  rectangular  matrix.  We  will  leave  the 

first  group  ^  unchcinged  and  replace  the  second  group  by  a  linear  combina¬ 
tion  ^  ^  .  The  orthogonality  condition  “  0  gives 

,  whereas  the  orthonormality  condition  ^  leads  to 

Jftt  ^  with  the  solution  .  The  result  is 

hence 


(66) 


which  is  a  generalization  of  the  standard  Schmidt  procedure  to  groups  of  func¬ 
tions.  Formula  (66)  is  useful,  for  instance,  in  deriving  the  orthogonalized  plane 
waves  or  in  handling  groups  of  orthogonalized  atomic  orbitals. 


-46- 


Orthonormalization  problem  in  crystal  theory.  -  The  orthonormalization 
problem  takes  a  very  interesting  form  in  crystal  theory  depending  on  the  trans¬ 
lational  symmetry  of  the  lattice.  Let  <C>(  )  be  an  arbitrary  atomic  orbital, 

i.e.  a  localized  function  centered  around  a  certain  lattice  point  which  we  may 
have  chosen  as  the  origin,  and  let  ^  denote  the  set  of  all  such  orbitals 
9  {  ft —  )  centered  around  the  equivalent  points  in  the  lattice.  This 

set  has  a  metric  matrix  A  with  the  elements: 


(67) 


which  is  cyclic  and  which  is  hence  brought  to  diagonal  form  by  the  unitary 
transformation 


(68) 


The  eigenvalues  of  A  are  then  given  by  the  formula 

Z  e,  A{0,«n)  .  (69) 


Instead  of  the  original  set  ,  we  can  now  introduce  a  set  cp  of  orthonormal- 

ized  AO's  by  the  symmetric  procedure  ^  .  Here  the  matrix 

may  be  evaluated  by  various  methods,  of  which  at  the  present  stage  the  Chebyshev 


expansion  method 


100)  . 


is  probably  the  most  forceful. 


It  is  also  of  interest  to  consider  the  canonical  orthonormalizatioii 
procedure  defined  by  (64).  Using  (68)  and  (17),  we  find  that  this  approach  leads 
directly  to  the  standard  Bloch-functions  associated  with  the  set  ^  in  a 
properly  normalized  form. 


The  Bloch  functions  can  actually  be  derived  from  the  given  atomic 
orbital  in  several  ways.  According  to  (20),  one  can  start  from  a 

single  orbital  ^  and  resolve  this  function  into  its  Bloch  components 


^1’')  -  Z 


) 


(<i) 


(70) 

(71) 


-47- 


where  is  an  unnormalized  Bloch  function  of  the  standard 

type  Of  course,  one  could  also  think  of  this  Bloch  fvinction  as  being  formed 
by  linear  combinations  of  the  atomic  orbitals  in  the  various  lattice  points 
(LCAO).  The  different  aspects  may  be  valuable  in  different  connections. 

Bloch  functions  associated  with  different  ‘It  -values  are  orthogonal, 
whereas  they  are  usually  not  normalized.  The  normalization  integral  for  the 
function  (7  1)  takes  the  form 


#>  =  ,  («) 

but  the  best  way  of  normalizing  the  Bloch  functions  is  probably  to  take  the 
Bloch  projections  (multiplied  by  of  the  orthonormalized  AO' s,  Cp  = 

,  where  the  matrix  is  evaluated  e.g.  by  Chebyshev 

3 

technique-  All  the  G  Bloch  functions  will  then  be  normalized  at  once,  whereas 
one  otherwise  has  to  carry  out  one  normalization  for  each  one  of  the  G^  l|  - 
values.  Valuable  information  may  also  be  obtained  by  combining  the  two  approaches. 

It  is  remarkable  that  the  LCAO  Bloch-functions  formed  from  the 
orthogonalized  AO's  cf  except  for  the  normalization  are  completely  identi¬ 
cal  with  those  formed  from  the  original  AO's  ^  .  This  is  a  speciad  case 

of  a  general  invariance  theorem,  saying  that  the  Bloch  projection  of  any  linear 
combination 

(S|) 

“  C  ,  (73) 


P.O.  LOwdux,  J.  Chem.  Phys.  JjB,  365  (1950);  Advances  in  Physics 
1  (1956),  p.  53;  R.G.  Parr,  J.  Chem.  Phys.  33,  1184  (i960). 


with  arbitrary  coefficients  A  (**9)  will.,  except  for  a  normalization  factor,  be 
identical  with  the  corresponding  Bloch  projection  of  the  function  ^  )  , 

According  to  (19),  one  has 

xfi 


=  ©4,  T(«)  =  e 


(74) 


and  applying  0|^  to  4' ,  we  obtain 


-48- 


©I,  C 

iru 


L  ZI  R  («n)  e> 

rtt^. 


(75) 


J  0*  Sit)  , 


which  proves  the  theorem.  In  this  connection,  the  projection  technique  is  hence 
very  convenient. 


Completeness  problem  in  ti£ht-binding  scheme.  -  It  has  been  discussed  in 
various  connections,  whether  the  atomic  orbitals  would  form  a  sufficient  basis 
for  band  theory  or  whether  something  essential  is  missing  in  the  tight-binding 
method.  It  is  evident  that,  if  one  introduces  a  complete  set  of  AO's  in  every 
lattice  point  ,  the  basis  will  be  highly  over  complete,  and  the  key  problem 

will  be  to  eliminate  the  redundancies  connected  with  the  linear  dependencies. 

If,  on  the  other  hand,  one  introdices  a  truncated  set  of  AO's  in  each  lattice 
point,  the  treatment  may  be  disturbed  by  approximate  linear  dependencies  at 
the  same  time  as  some  essential  element  may  be  missing. 

From  theoretical  point  of  view,  it  is  sufficient  to  introduce  a  complete 

I 

set  of  AO's  {i^^  }  in  a  single  lattice  point,  since  we  may  then  use  expansion 
(34),  i.e.  =  2  f^  c^  .In  studying  the  Bloch  functions,  we  can  apply  the 
projection  operator  0|^  and  go  over  from  (34)  to  (37),  i.e. 

which  relation  says  that  it  is  possible  to  express  every  Bloch  function  associated 
with  the  wave  vector  ^  in  terms  of  the  subset  (0|^f|^  ).  From  the  complete¬ 
ness  of  {fj^  }  follows  hence  the  completeness  of  {Oj^f^  }  with  respect  to 
the  subspace  characterized  by  .  Consequently,  nothing  can  be  missing. 

However,  if  one  uses  a  set  of  hydrogen-like  orbitals  Is,  Zs,  2p,  3s, 

3p,  3d,  ....  and  constructs  the  corresponding  Bloch  functions,  one  will  find 
a  peculiarity  in  analyzing  these  functions  in  terms  of  plane  waves  once 

the  orbitals  for  neighbouring  atoms  start  having  large  overlap,  the  main  con¬ 
tribution  to  the  Bloch  function  will  come  from  the  first  Brillouin  zone.  Except 
for  the  region  around  the  nucleus,  the  Bloch  functions  will  then  become  more 
and  more  similar  to  a  free  wave  associated  with  the  first  zone,  and  little  new 


-49- 


will  be  obtained  by  adding  more  (n  t  )-function8.  One  should  remember,  how¬ 
ever,  that  the  higher  functions  contribute  to  the  description  of  the  inner  parts 
of  the  atoms,  and  that  a  particularly  important  part  comes  from  the  continuum, 
which  is  necessary  to  make  the  basis  {  }  complete. 

If  one  neglects  the  continuum  in  the  tight-binding  approximation,  one 
is  certainly  leaving  out  a  very  important  part  of  the  basis.  It  is  true  that  the 
handling  of  the  continuum  functions  may  cause  some  mathematical  difficulties, 
but  these  are  easily  circumvented  if  one  follows  Schrddinger's  suggestion 


£.  SchrSdinger,  Ann.  Physik  79,  361  (1926).  . 


and  uses  a  set  which  is  both  entirely  discrete  and  complete:  such  a  set  is  easily 
derived  from  the  hydrogen-like  orbitals  by  omitting  the  principal  quantum  num¬ 
ber  n  in  the  radial  variable  p  «  2Zr/n  •  These  new  functions  Is,  Zs,  Zp,  3s, 
3p,  . . .  will  be  more  localized  within  the  atomic  cell  of  interest,  they  will  give 
more  details  concerning  the  ion  core  and  the  nuclear  region,  at  the  same  time  as 
the  higher  orbitals  will  give  Bloch  functions  which  are  close  to  free  waves.  The 
set  of  modified  atomic  orbitals  has  proven  to  be  extremely  useful  in  atomic  wd 
molecular  theoyy  and  it  vdll  probably  be  just  as  valuable  in  crystal  theory. 


H.  ShuUandP.O.  Lflwdin,  J.  Chem.  Phys.  1362  (1955):  1035 

(1956);  30.  617  (1959):  E.  Hol^ien,  Phys.  Rev.  K|4,  1301  (1956); 

Proc.  Phys.  Soc.  A71,  357  (1958);  J.O.  Hirschfelder  and  P.O.  Ldwdin, 
Molecular  Physics  2,  229  (1959). 


One  could  ask  how  an  orthonormal  set  of  Block  functions  should  best 
be  constructed  in  the  tight-binding  scheme  to  give  a  basis  which  is  in  principle 
complete  and  which  does  not  contain  any  linear  dependencies.  If  {f^  }  denotes 

the  set  of  modified  atomic  orbitals  in  a  single  lattice  point,  the  projected  sub¬ 
sets  }  associated  with  different  reduced  wave  vectors  M  are  cer¬ 

tainly  mutually  orthogonal  and  non-interacting  with  respect  to  ,  but  the 
individual  functions  within  each  subset  {0|^f^}  are  neither  normalized  nor 
orthogonal.  Since  the  functions  Is,  2s,  Zp,  Ss,  3p,  33,  . .  .  form  a  natural 
sequence,  the  fixnctions  within  each  subset  {0|^  }  are  conveniently  transformed 

by  means  of  successive  orthonormalization.  If  only  a  limited  number  of  points  in 
-space  will  be  studied,  this  is  a  procedure  which  is  easily  carried  out  by 
considering  one  ^  -vatlue  at  a  time. 


-50- 


However,  if  it  is  desirable  to  derive  a  complete  set  of  Bloch  functions 
which  are  orthonormal  within  all  the  subsets  asBocia,ted  with  the  reduced 
wave  vector  ,  it  is  simpler  to  start  by  doHving  a  complete  set  of  atomic 

orbitals  orthonormalized  over  all  the  lattice  points.  In  such  a  case,  one  starts 
by  considering  the  functions  in  all  the  lattice  points  and  carries  out  a 
symmetric  orthogonalization  according  to  (61)  with  ,  proceeds  in  the 

same  way  with  all  the  functions  fs  ,  with  all  the  functions  2p  ,  ■  •  .  .  etc.  one 
type  at  a  time.  This  procedure  seems  physically  feasible,  since  all  the  lattice 
points  are  treated  in  an  equivalent  way.  It  leads  to  a  sequence  of  groups  of 
orthonormalized  atomic  orbitals,  which  are  then  made  mutually  orthogonal  by 
means  of  the  successive  orthogonalization  obtained  by  repeated  use  of  formula 
(66).  In  each  lattice  point,  one  gets  in  this  way,  a  set  of  orthonormal  atomic 
orbitals  Is',  2s',  2p',  3s',  3p',  ...  which  are  translationally  connected  and 
altogether  complete*  Finally,  one  forms  die  Bloch  projections 

a*''"' 

which  constitute  the  orthonormal,  complete  set  desired.  Each  Bloch  function  is 
here  characterized  by  the  reduced  wave  vector  and  an  index  corresponding 

to  the  atomic  quantum  numbers  (n  t  m) . 

By  using  the  invariance  theorem  (71),  it  may  be  shown  that  the  two 
ways  of  proceeding  here  described  actually  lead  to  identical  result.  For  the 
moment,  it  seems  simpler  to  construct  the  complete  set  of  translationally  con¬ 
nected  atomic  orbitals  Is',  2s',  2p',  3s',  ..  .  since  one  can  use  the  Chebyshev 
technique  for  evaluating  the  (-?)  power  of  a  cyclic  matrix  in  both  steps 
of  the  procedure,  but,  of  course,  it  should  be  possible  to  find  the  corresponding 
short-cut  also  in  the  other  approach. 

By  constructing  a  complete  orthonormal  set  of  Bloch  functions  of  the 
type  (77),  one  can  hence  remove  two  weak  points  in  the  tight-binding  approxima¬ 
tion,  namely  the  occurrence  of  approximate  linear  dependencies  and  the  in¬ 
completeness  particularly  with  respect  to  the  inher  region  around  each  lattice 
point  Otherwise  arising  from  the  neglect  of  the  continuum. 


-51- 


(b)  Recent  Applications 

For  applications  of  the  tight-binding  approximation  to  ciystal  theory, 

23) 

we  will  again  refer  to  the  previously  mentioned  reviews  by  LOwdin  ’ , 

Herman  and  Pincherle  and  cornment  only  on  some  recently  published 
papers. 

The  relation  between  the  MO-L.CAO  method  in  molecular  theory  and 
the  tight-binding  scheme  in  crystal  theory  can  be  particularly  well  studied  in 
connection  with  the  graphite  problem,  where  one  can  start  out  from  a  single 
six-membered  ring  as  in  the  benzene  molecule,  add  more  2uid  more  rings  until 
one  obtains  a  graphite  layer,  and  finally  add  the  layers  to  a  three-dimensional 
crystal.  The  electronic  structure  of  graphite,  its  diamagnetism  and  other 
properties  have  successfully  been  studied  in  this  way 


See  e.g.  C.A.  Coulson  and  R.  Taylor,  Proc.  Phys.  Soc.  (London) 

A 65,  815  (1952);  D.F.  Johnston,  Proc.  Roy.  Soc.  (London)  A237, 

48  (1956);  M.  Yamasaki,  J.  Chem.  Phys.  2^,  930  (1957); 

J.W.  McClure,  Phys.  Rev.  108,  6l2  (1957);  R.R.  Haering,  Can.  J. 
Phys.  352  (1958);  S.  Mase,  J.  Phys.  Soc.  Japan  13,  563  (1958); 
J.C.  Slonczewsky  and  P.R.  Weiss,  Phys.  Rev.  109,  272  (I958)i 
T.E.  Peacock  and  R.  McWeeny,  Proc.  Phys.  Soc.  (London)  74,  385 
(1959);  H.  Sato,  J.  Phys.  Soc.  Japan  14,  609  (1959);  J.  Kontech^  and 
M.  Tom^sek,  Phys.  Rev.  120,  1212  (I960). 


In  connection  with  diamond-type  crystals,  the  work  by  Schmid 
using  VB -method  has  previously  been  mentioned,  and  here  we  will  only  add  a 
study  by  Morita  where  he  uses  a  semi-localized,  crystal  orbital  method. 


1Q6) 


A,  Morita,  Progr.  Theoret.  Phys.  19, 


534  (1958). 


Among  the  papers  on  boron  crystals,  we  would  like  to  mention  an 
extensive  investigation  of  the  electronic  structure  and  band  properties  of  the 
metal  borides  of  type  MB^  carried  out  by  Flodmark  and  a  study  of  boron 


S.  Flodmark,  Arkiv  f .  Fysik  9,  1357  (1955);  JJ,  417  (1957);  JA,  513 
(1959);  Svensk  Kemisk  Tidsk.  70,  12  (1958). 


107) 


-52- 


carbide  by  Yamasaki 


108) 


108) 


M.  Yamasaki,  J.  Chem.  Phys.  27  ,  746  (1957)^ 


The  oxide  ionic  crystals  offer  an  interesting  problem  and 

Yamashita  has  now  extended  his.  previous  work  to  a  study  of  the  oxygen 
band  in  magnesium  oxide,  whereas  O'Sullivan  ^  has  treated  beryllium  oxide 


109) 

110} 

111) 


J.  Yamashita  and  M.  Kojima,  J.  Phys.  Soc.  Japan?,  26l  (1952). 
J.  Yamashita,  Phys.  Rev.  7S3  (1958). 

W.  O'SuUivan*  J.  Chem.  PhyS.  30,  379  (1959). 


The  tight-binding  studies  of  the  alkali  hydrides  and  alkali  halides  are 

being  continued.  The  covalent  character  of  lithium  hydride  has  been  investigated 

112V  . 

by  Morita  and  Takahashi  '  using  semi -localized  crystal  orbitals,  whereas  the 


112) 


A.  Morita  and  K.  Takahashi,  Progr.  Theo'ret.  Phys.  ^9,  257  (1958). 


behaviour  of  this  crystal  under  very  high  pressure  has  been  treated  by  . 

113^ 

Behringer  ' .  The  electronic  structure  of  the  alkali  halides  has  been  studied 

by  Grimley  with  particular  attention  to  lithium  fluoride.  Howland  has 
finally  carried  through  a  careful  study  of  the  band  structure  and  cohesive  proper¬ 
ties  of  potassium  chloride. 


113) 

114) 

115) 


R.  E.  Behringer,  Phys.  Rev.  113,  787  (1959). 

T.B.  Grimley,  Proc.  Phys.  Soc.  (London)  70,  123  (1957);  7 1,  749 
(1958). 

L.P.  Howland,  Phys.  Rev.  109,  1927  (1958). 


The  ionic  crystals  with  constituents  having  completely  filled  shells  are 
remarkable  from  the  point  of  view  that  the  naive  MO-meth<^  and.  Ae  naive  VB- 
method  lead  to  identic^  results  with  respect  to  aU  properties  which  may  be 


-53- 


derived  from  the  total,  wave  function.  These  crystals  are  also  particularly  con- 
vexiient  for  a  study  by  means  of  the  tight-binding  approximation,  and  a  rather 
fixed  approach  seems  finally  to  have  been  established.  In  this  connection,  .we 
would  like  to  make  some  critical  comments  on  the  conventional  interpretation 
of  the  data  obtained  in  calculating  e.g.  the  cohesive  energy. 


(c)  Virial  Theorem  in  Theory  of  Ionic  Crystals 

The  classical  theory  of  ionic  crystals  developed  by  Madelung  £uid  Born 
was  based  on  the  fundamental  assumption  that  the  essential  constituents  of  such 
a  crystal  are  the  positively  and  negatively  charged  ions.  The  system  of  ions 

'  t  ■  ' 

was  assumed  to  be  in  equilibrium  under  the  influence  of  two  types  of  potentials: 
an  attractive  potential,  corresponding  to  the  electrostatic  interaction  between 
the  ions  as  point  charges  and  Represented  by  a  Madclong  energy,  and  a  repulsive 
potential,  for  which  Bom  and  Land£  suggested  the  inverse  power  C  r'°  and 
later  Bora  and  Mayer  the  exponential  C  exp(-r/^  ). 

A  characteristic  feature  of  this  model  is  that  the  Madelung  energy 
forms  the  dominating  part  of  the  cohesive  energy  of  the  crystatl.  In  a  recent  in¬ 
vestigation  it  has  been  pointed  out,  however,  that  the  cohesive  energy 

actually  condsts  of  several  large  terms  of  the  same  order  of  magnitude  as  the 
Madelung  contribution,  and  that  the  kinetic  energy  plays  a  very  important  role 
in  this  connection. 


A.  Frdman  and  P.O.  L.Swdin,  Technical  Note  51,  Uppsala  Quantum 
Chemistry  Group  (I960);  J.  Phys.  Chem.  Solids  ZO,  ...  (1961). 


The  ratio  between  the  kinetic  energy  <  T  >  and  the  potential  energy 

117) 

<  V  >  is  determined  by  the  virial  theorem  '  which,  for  a  system  with,  only 
coulombic  interactions,  takes  the  special  form  <  T  >  s  -^  <  Y  >  ,  or  <  T  >  = 

=  -E,  <  V  >  =  +2E ,  where  E  is  the  total  energy,  E  »  <  T  +  V  >  .  For  an  ionic 
crystal,  the  virial  theorem  is  satisfied  in  this  simple  form  both  for  the 
equilibriuin  state  (R  «  R^)  and  for  the  free  ions  (R  «  so),  here  indicated  by  an 
index  f  {*  free). 


E.A.  Hylleraas,  Z.  Physik  M,  347  (1929);  V.  Fock,  Z.  Physik 
855  (1930).  For  more  complete  references,  see  P.O.  LSwdin,  J.  Mol. 
Spectroscopy  3,  46  (1959). 


-54- 


The  cohesive  encirgy  ^coh  is  defined  as  the  difference  between  the 


total  energy  of  the  crystal  in  its  ground  state  and  the  energy 
constituents,  so  that  ®coh  *  ®o  * 


E^  of  the  free 


The  change  in  kinetic  energy  AT  and 
the  change  in  potential  energy  AV  are  further  defined  by  the  relations: 


AT  =  T^  -  Tj  ,  AV  =  (78) 

and,  using  the  virial  theorem  for  both  states,  we  hence  obtain 

=  2  =coh  <”> 


These  relations  show  that  the  kinetic  energy  increases  under  the  formation  of  a 

solid,  whereas  the  potential  energy  decreases  twice  as  much  leaving  a  balance 

equal  to  the  cohesive  energy:  AT  4  AV  =  •  The  kinetic  energy  of  a  bound 

state  is  hence  considerably  larger  than  the  kinetic  energy  of  the  free  constituents, 

i  18) 

which  to  a  certain  extent  are  excited  or  "promoted*  '  in  a  compound. 


K.  Rfldenberg,  Revs.  Modem  Phys.  34,  ....  (1962);  in  press. 


In  Table  II,  we  have  gathered  the  values  of  the  cohesive  energy  for 
some  of  the  alkali  halides  obtained  empirically  by  means  of  the  Born-Haber 
cycle.  We  have  further  listed  AT  according  to  (79)  whereas  AV  has  been 
divided  into  two  terms:  the  Madelung  energy  remaining  potential 

energy  which  must  necessarily  depend  on  the  extension  of  the  ions.  The 

last  term  is  negative  and  of  the  same  order  as  the  Madelung  energy. 

Because  of  the  kinetic  energy  term,  which  here  contains  also  a  small 
contribution  from  the  nuclear  motion,  the  interpretation  is  certainly  strikingly 
different  from  the  conventional  one.  It  may  be  shown  that  the  quantum- 
-mechanical  calculations  of  the  cohesive  energy  of  the  alkali  halides  carried 
out  so  far  on  the  basis  of  the  tight-binding  approximation,  by  means  of  an 
adjustable  scale  factor  may  be  brought  in  complete  agreement  with  this 

picture.  However,  the  simple  Born-Mayer  model  has  certainly  also  to  be 
modified  to  fulfil  the  requirement  of  the  virial  theorem. 


I 

I 


-55- 


TABLE  II.  Interpretation  of  the  cohesive  energy  of  some  alkali  halides 
according  to  FrSman  and  Lftwdin,  /J*  Phys.  ;Chem.  Splids- 
M . (1961)/. 

AT  s  Increase  in  kinetic  energy  in  formation  of  solid 
AV  s  Decrease  in  potential  energy  in  formation  of  solid 


Units:  kcal/mole 


- - 

Crystal 

E  , 
coh 

AT 

jBBQ 

IkHI 

■ 

fmBmi 

LiF 

-244.4 

244.4 

-291.0 

-197.8 

NaF 

-216.3 

216.3 

-240.6 

-192.0 

KF 

-192. 1 

192.  1 

-219.5 

-164.7 

RbF 

-184.4 

184. 4 

-208,0 

-160.8 

LiCl 

-201.7 

201,7 

-228. 1 

-175.3 

NaCl 

-184.4 

184.4 

-208.0 

-160.8 

KCl 

-167.9 

167.9 

-186. 3 

-149.5 

RbCl 

-162. 1 

162.  1 

-179.4 

-144.8 

LiBr 

-191.3 

191.3 

-213.0 

-169.6 

NaBr 

-175.9 

175.9 

-196.8 

-155.0 

KBr 

-161,0 

161.0 

-178.6 

-143.4 

RbBr 

-155.8 

155. 8 

-171.5 

-140. 1 

LU 

-179.3 

179.3 

-195. 1 

-163.5 

Nal 

-165.5 

165.5 

-181.4 

-149.6 

KI 

-152.3 

152.3 

-166.3 

_ 

-138.3 

-56- 


5.  EXTENSION  OF  BAND  THEORY; 

DIFFERENT  ORBITALS  FOR  DIFFERENT  SPINS 

As  mentioned  earlier  in  this  review,  it  has  been  pointed  out  by  Slater, 
Pauling,  Mott,  and  others  that  the  naive  valence  bond  method  is  superior  to  the 
ordinary  band  theory  in  treating  correlation  effects  and  particularly  that  the 
former  leads  to  a  correct  asymptotic  behaviour  of  the  energy  curve  for 
separated  atoms;  compare  Fig.  1.  On  the  other  hand,  band  theory  has  niany 
advantages  in  describing  conductivity  and  similar  properties,  and  the  question 
is  whether  it  is  possible  to  combine  the  advantages  of  the  two  approaches  by  a 
synthesis  of  the  two  ideas.  This  can  be  done  by  a  generalization  of  band  theory 
which  removes  part  of  the  correlation  error  discussed  in  Sec.  2d. 


Extended  Hartree-Fock  scheme.  -  The  large  correlation  errors  in  the  con¬ 
ventional  Hartree-Fock  scheme  depend  undoubtedly  on  the  fact  that  pairs  of 
electrons  of  opposite  spins  are  forced  together  in  doubly  filled  orbitals.  This 
electron  pairing  goes  back  partly  to  the  classical  formulation  of  Pauli* s  exclusion 

principle,  partly  to  the  fact  that  this  procedure  permits  a  simple  construction  of 

2 

Slater  determinants  as  pure  eigenfunctions  to  the  total  spin,  S  and  S^  .  One 
can  apparently  remove  a  large  part  of  the  correlation  error  by  letting  electrons 
with  different  spins  occupy  different  orbitals  in  space,  so  that  they  get  a 
possibility  to  avoid  each  other;  compare  the  discussion  of  the  "Coulomb  hole" 
in  Sec.  2d. 

1 191 

The  idea  of  this  orbital  splitting  comes  originally  from  Hylleraas  ' 
who  used  it  in  treating  the  helium  atom,  and  it  was  intensely  discussed  for  two- 
-electron  systems  at  the  Shelter  Island  Conference  in  1951.  There  is  an. 


E.A.  Hylleraas,  Z.  Physik  54,  347  (1929);  C.  Eckart,  Phys.  Rev.  36, 
878  (1930). 

M.  Kotani,  Proc.  Shelter  Island  Conf. ,  139  (1951);  G.R.  Taylor  and 
R.G.  Parr,  Proc.  Nat.  Acad.  Sci.  U.S.  38,  154  (1952);  J.E.  Lennard- 
-Jones,  Phil.  Mag.  43,  581  (1952);  R.S.  Mulliken,  Proc.  Nat.  Acad. 
Sci.  U.S.  38,  160  (1952). 


-57- 


obvious  difficulty  in  generalizing  the  idea  to  a  many-electron  system  depending 
on  the  fact  that,  if  one  permits  different  orbitals  for  different  spins,  the 
corresponding  Slater  determixiant  will  no  longer  be  a  pure  spin  state. 

By  means  of  a  simple  projection  operator  technique,  the  Slater  deter¬ 
minant  D  =  (Nl)"^  det{4»j,  4*2'  ‘J'3 . may  uniquely  be  resolved  into 

pure  spin  components  ,  which  are  orthogonal  and  non-iiteracting  with 

respect  to  the  total  Hamiltonian  (7),  so  that 


J5  =  E 


(80) 


where  one  should  sum  over  all  values  of  S  involved.  The  component  of  the 
specific  multiplicity  (2S+1)  is  selected  by  means  of  a  projection  operator  of 
the  form 


(81) 


which  annihilates  all  components  except  the  one  desired,  which  survives  the 

2 

operation  in  an  unchanged  form.  The  operator  O  fulfills  the  relations  O  =  O, 
=  O,  =  S(S  +  1)  O  and  its  properties  have  been  studied  in  detail 


P.O.  LBwdin,  Phys.  Rev.  1509  (1955);  Coll.  Int.  Centre  Nat. 
Rech.  Sci.  23  (Paris,  1958);  Technical  Note  12,  Uppsala  Quantum 
Chemistry  Group  (1958). 


It  is  now  possible  to  introduce  an  extension  of  the  Hartree-Fock 
scheme  by  considering  a  total  wave  function  which  is  approximated  by  the 
component  of  the  Slater  determinant  D  which  has  the  pure  spin  desired,  so 
that 

^  ... 

If  the  basic  spin-orbitals  4*2'  '('3'  ’  *  *  ^  ^  subject  to  a  linear 

transformation,  this  wave  function  is  changed  only  by  a  constant.  This  Implies 
that  the  Fock-Dirac  density  matrix  ^  defined  by  (4)  will  be  the  fundamental 
invariant  of  the  theory,  which  determines’ all  physical  properties.  Since  the 


58- 


projection  (81)  will  affect  only  the  spin  functions,  it  is  clear  that  the  total  wave 
function  Y  will  depend  only  on  the  two  space  density  matrices  ) 

and  which  are  contained  in  p: 

.  (83) 

For  the  expectation  value  of  the  Hamiltonian  one  obtains 

V  <3>l<at(DII>> 

2 

where  one  has  used  the  turn-over -rule  and  the  relation  O  =  O  .  The  variation 
principle  6<  H  >  =  0  leads  to  the  best  possible  density  matrices  and  p_  , 
or  to  the  corresponding  best  spin-orbltals.  The  approach  may  be  characterized 
as  an  extended  Hartree-Fock  scheme  which  preserves  the  simple  physical 

visuality  of  the  one-electron-niodel  but  still  removes  a  very  large  fraction  of  the 
total  correlation  error. 


P.O.  LSwdin,  Nikko  Symp.  Mol.  Phys.,  13  (Maruzen,  Tokyo  1954); 
Phys.  Rev.  97,  1509  (1955);  Proc.  10th  Solvay  Conference,  71  (1955); 
Revs.  Modern  Phys.  328  (1960). 


The  general  treatment  of  the  extended  Hartree-Fock  theorem  is 
greatly  simplified  by  the  existence  of  a  pairing  theorem  with  respect  to  the 
orbitals  in  and  .  Let  u^,  U2,  ...  u^  and  v^, 

the  orbitals  contained  in  and  ,  respectively.  Each  set  may  be  chosen 
orthonormal  and,  in  addition,  there  exists  two  unitary  transformations  XT 
and  V  ,  so  that  the  two  transformed  sets  ^  fulfil 

the  relation 

I  \  ^Jti  .  (85) 


This  implies  that,  without  loss  of  generality,  the  orbitals  may  be  chosen  so 
that  each  orbital  in  is  orthogonal  to  all  orbitals  in  •  except  possibly 

one  to  which  it  is  paired  with  an  overlap  integral  fulfilling  the  inequality  ; 


-59- 


0<Xj^<  l.lf  m>n,  the  extra  orbitals  in  may  always  be  chosen 
orthogonal' to  all  orbitals  in  •  The  proof  follows  simply  by  considering  the 

quadratic  or  rectangular  overlap  matrix  >ilS  —  of  order  m  x  n  and 

the  unitary  transformations  U  and  bringing  the  hermitean  matrices 

and  respectively,  to  diagonal  form.  The  pairing  theorem 

introduces  far-reaching  orthogonality  simplifications  in  the  calculations  and 
makes  it  possible  to  evaluate  the  energy  in  (84)  in  a  straight-forward  way; 


The  solution  of  the  ordinary  Hartree-Fock  equations  for  a  molecular  or 
crystal  system  is  a  very  complicated  matter,  and  one  can  expect  that  the  treat¬ 
ment  of  the  extended  equations  will  be  still  more  difficult.  An  ab  initio  calcula¬ 
tion  of  and  would  certainly  give  valuable  information  about  the 

mutual  behaviour  of  electrons  having  antiparallel  spins,  but,  for  the  moment  one 
has  to  be  satisfied  with  highly  approximate  solutions  based  on  suitable  trial 
functions  and  a  few  adjustable  parameters.  In  choosing  the  trial  functions,  one 
is  to  a  certain  extent  guided  by  the  idea  that  "electrons  with  different  spins  do 
try  to  avoid  each  other”,  but  the  justification  of  the  entire  approach  is  the 
energy  lowering  finally  obtained.  In  connection  with  the  orbital  splitting,  one 
speaks  of  "in-out  effect",  "right-  and  left-effect",  "up-and  down-effect", 
"alternant  effect",  etc.,  but  only  the  last  idea  will  be  briefly  discussed  here. 


Alternant  Crystal  Orbital  Method.  -  In  this  section,  we  will  consider  an  exten¬ 
sion  of  the  ordinary  band  theory  which  is  inspired  by  certain  aspects  of  the 
valence  bond  method.  Again  it  is  convenient  to  explain  the  idea  by  starting 
from  the  hydrogen  molecule.  If  a  and  b  are  the  atomic  orbitals  involved,  the 
molecular  orbital  wave  function  and  the  valence  bond  wave  function  are  actually 
represented  by  the  anti- symmetric  singlet  components  of  the  Hartree-products 
(a^  +  hj)(a2  +  b^)  P2  ^1^2®  1^2'  *'«»P«ctively;  see  Fig.  2  .  In  additton, 

we  may  now  consider  the  antisymmetric  singlet  component  of  the  Hartree-  . 
-product  ^  semi-localized  molecular 

orbitals  ^  '  given  by  the  expression 


123) 


C.A.  Coulson,  and  I.  Fischer,  Phil.  Mag.  386  (1949). 


When  =  0 ,  one  obtains  the  naive  VB -method,  whereas  for  -d  =  45^  one  gets 


I 


-  r 


-60- 


Fig.  Z.  Comparison  between  the  arrangements  of  orbitals  and 

spins  in  the  valence  bond  method  l)i  the  molecular- 
-orbital  method  2),  and  the  alternant  molecular -orbital 
method  3);  H2'-inolecule. 

t  i 

.  1 

j 

I 

1 


i 

i 


I 


-61- 


the  naive  MO-method.  The  parameter  'd  give*  us  hence  a  possibility  of  a  con¬ 
tinuous  transition  from  one  type  of  theory  to  the  other;  it  measures  the  degree 

to  which  the  two  electrons  would  like  to  avoid  each  other,  and  may  hence  be 

denoted  as  the  "correlation  angle".  A  value  of  intermediate  between  0  and 

45°  corresponds  to  a  valence -bond  method  including  polar  states,  to  a  molecular- 
-orbital  method  including  configuration  interaction,  or  to  an  extended  MO- 
-  approach  along  the  lines  sketched  above. 

For  a  valence  crystal,  one  could  now  think  of  an  extended  Hartree- 
-Fock  scheme  in  terms  of  localized  orbitals  *  ',  where  and  are 


Compare  references  41  and  42,  with  respect  to  the  ordinary  Hartree- 
-Fock  method. 


such  that  each  pair  (u^^,  v^^)  would  be  associated  with  a  specific  valence  bond. 
Because  of  the  relation  (85),  there  may  then  be  a  close  connection  between  the 
general  pairing  theorem  in  the  extended  Hartree-Fock  scheme  and  the  ortho¬ 
gonality  assumption  (57)  in  the  extended  valence  bond  method  or  "perfect-pairing" 
approximation  discussed  in  Sec.  3c. 

Let  us  now  consider  a  simple  crystal  with  a  half-filled  conduction  band, 
like  the  alkali  metals.  The  ordinairy  band  theory  is  here  affected  by  a  consider¬ 
able  correlation  error  which  is  particularly  accentuated  in  the  wrong  behaviour 
of  the  singlet  energy  curve  for  separated  atoms;  see  Fig.  1  (page  29)  and  the 
discussion  in  Sec.  2d.  In  his  classical  1930  paper.  Slater  has  studied  this 
problem  in  connection  with  the  body-centered  cubic  sodium  metal,  and  he  pointed 
out  that  it  seemed  desirable  to  find  a  modification  of  the  ordinary  MO-theory  which, 
for  separated  atoms,  would  go  over  into  some  form  of  VB-treatment  based  on  the 
idea  that  the  electrons  with  antiparallel  spins  would  separate,  so  that  the  elec¬ 
trons  with  plus,  spin  would  occur  in  the  "corners"  and  the  electrons  with  minus 
spin  in  the  "centers"  of  the  lattice;  see  Fig.  3.  The  advantage  of  such  a  spin 
arrangement  would  be  that  it  wotdd  prevent  the  formation  of  negative  ions,  which 
is  the  cause  of  the  wrong  asymptotic  behaviour  of  the  energy  curve.  We  will  now 
try  to  realize  and  generalize  this  idea. 

The  body- centered  cubic  lattice  is  a  special  type  of  an  important  class 
of  crystals  which  is  called  alternant  systems,  and  which  is  characterised  by  the 
fact  that  all  lattice  points  may  be  divided  into  two  equivalent,  interpenetrating 


-62- 


Fig.  3.  Spin  arrangement  for  separated  atom*  in  body-centered 
cubic  lattice  of  sodium  metal. 


.63- 


sublattices  (l)  and  (II).  The  sublattice  (11)  is  supposed  to  contain  the  origin  and 
will  be  called  the  even  sublattice,  whereas  (I)  will  be  called  the  odd  sublattice. 

In  order  to  obtain  an  extension  of  the  ordinary  band  theory,  we  will  now  try  to 
introduce  alternant  crystal  orbitals  which  are  semi -localized  on  the  two  sub¬ 
lattices,  and  let  electrons  with  plus  spin  tend  to  be  associated  with  sublattice 
(1)  and  those  with  minus  spin  associated  with  sublattice  (U). 

For  this  purpose,  we  will  consider  the  space  of  the  reduced  wave  vec¬ 
tor  4ft  and  all  points  which  are  situated  within  the  Fermi- surface  of  ordinary 
band  theory.  Instead  of  the  single  Bloch  projection  operator  0|^  defined  by  (17): 

(O 

0.  »  Q  c  e  T(-«.)  (") 


It  is  now  convenient  to  introduce  the  two  partial  sums  oyer  the  two  sublattices: 


q  C  c 

nn 


-3 


on 


each  one  containing  G^/2  terms,  and  the  splitting  operators: 

Q  lei  ~  ^  )  ) 


(88) 


(89) 


These  operators  will  work,  for  instance,  on  an  atomic  orbital  4^1^)  situated 
around  the  origin  and  will  givei  rise  to  a  set  of  alternant  crystal  orbitals  with  one 
pair  for  each  4ft  -value.  For  ^  =  45^,  there  will  be  no  splitting  and  the  func¬ 
tions  within  each  pair  will  be  identical  and  equal  to  ordinary  Bloch  functions.  For 
"9*0,  there  will  be  a  complete  splitting  and  delocalization  of  each  pair  on  the 
two  sublattices  involved,  in  accordance  with  Slater's  idea 


The  operators  (88)  and  (89)  are  all  hermitean  and  satisfy  some  simple 
algebraic  relations  which  are  useful  in  the  applications.  One  has  Oj  s  iOjj  , 

'  °I  ®II  “  ^II  °I  =  iOj ,  where  for  simplicity  we  have  omitted  the  in¬ 
dex  4|  .  This  gives  further 

£!iQl=  QiQx- 


(90) 


-64- 


which  relations  are  used  in  calculating  the  normalization  integrals  and  the  over¬ 
lap  within  the  pair. 

We  note  that  the  splitting  operators  Q  are  not  eigenfunctions  to. all' 
three  primitive  translations  but  that  they  always  fulfil  the  relation:  . 

where  is  a  general  translation  from  one  point  in  a  sublattice  to  an  equiv- 

alent  point  within  the  same  sublattice.  From  this  property  follows  also  the 
general  orthogonality  relation: 

“  C)  )  ,  (92) 

which  says  that  the  splitting  operators  applied  to  a  function  cPt’l)  will  render 
us  a  set  of  alternant  crystal  orbitals  satisfying  the  pairing  theorem  (85).  For 
each  point  %  ,  there  is  hence  an  overlapping  pair  which  is  orthogonal  towards 

all  other  pairs.  This  property  greatly  simplifies  the  applications  of  the  theory. 

The  basic  Slater  determinant  D  is  now  constructed  by  asfigning  a-spin 
to  orbitals  of  type  I  and  p-spin  to  orbitals  of  type  EL  for  all  points  H  within  the 
Fermi  surface,  so  that  the  electro  is  are  permitted  to  avoid  each. other.  One 
takes  the  projection  (82)  and  evaluates  the  energy  expectation  value  accord!^  to 
(84),  and  the  best  value  of  the  "correlation  angle"  -ii  and  the  best  form  of  cp(^) 
are  then  determined  by  means  of  the  variation  principle  6  <  H>  s  0  . 

It  is  evident  that  an  important  generalization  of  this  approach  is  possible 
by  letting  the  correlation  angle  ^  be  a  function  of  the  reduced  wave  vector  V  : 

%  -  %(lf)  ^  (93) 

where  the  form  of  the  function  could  again  be  determined  by  the  variation 
principle 


In  comparison  to  some  earlier  work,  references  121  and  122,  a  change 
of  notation  =  45  -  8  should  be  observed.  Even  8  was  previously 

characterized  as  "correlation  angle". 


-65- 


It  is  remarkable,  however,  that  a  large  improvement  can  be  obtained 
by  using  a  single  parameter  -d  and  particularly  that  a  correct  asymptotic 
behaviour  of  the  singlet  energy  curve  for  separated  atoms  can  be  acMeved  by 
observing  that  approaches  For  0  =  0°  ^  one  gets  purely  altei;nant 
orbitals  which  are  completely  delocalized  on  the  two  sublattices  and,  by  a 
proper  choice  of  they  can  be  made  strictly  orthogonal.  In  this  case 

the  energy  (84)  takes  the  simple  form: 

where  the  latter  term  goes  to  zero  for  separated  atoms  and,  since  there  is  no 
accumulation  of  negative  ions,  the  energy  curve  gets  the  correct  asymptotic 
behaviour.  Of  still  larger  importance  are  probably  the  improvements  which  can 
be  obtained  for  the  equilibrium  state  (R  =  R^)  . 

This  approach  has  so  far  been  essentially  tested  only  for  molecules, 
where  actually  the  difficulties  connected  with  forming  the  projection  (80)  are 
particularly  accentuated.  In  an  investigation  of  the  benzene  molecule,  Itoh  and 
Toshlzumi  obtained  ■d  «  22°  and  could  show  that  about  85  o/o  of  the 
previously  known  correlation  energy  could  be  removed,  and  this  result  has 
recently  been  improved  by  de  Heer  using  two  parameters  i>  .  The  approach 
has  further  proven  to  be  valuable  in  a  study  of  the  alternating  spin  densities  in 
odd  alternant  hydrocarbon  radicals  it  has  been  used  successfully  for  in¬ 

vestigating  the  correlation  properties  in  the  finite  and  infinite,  linear  chain 
with  the  idea  of  making  applications  to  conjugated  systems;  Studies  of  three- 
-dimensional  crystals  are  now  in  progress. 


126) 

127) 

128) 

129) 


T.  Itoh  and  H.  Yoshizumi,  J.  Phys.  Soc.  Jap  10,  201  (1955); 

J.  Chem.  Phys.  23,  412  (1955);  Busseiron  Kenkyu  83,  13  (1955). 

J.  de  Heer  (private  communication). 

R.  Lefebvre,  H.H.  Oearman,  and  H.  M.  McConnell,  J.  Chem.  Phys. 
32,  176  (1960). 

R.  Pauncz,  J.  de  Heer,  and  P.O.  LOwdin,  Technical  Notes  55  and 
56,  Uppsala  Quantum  Chemistry  Group  (I960);  J.  Chem.  Phys.  ... 


-66- 


Actually,  it  seems  easier  to  use  the  alternant  orbital  method  for 
treating  crystals  and  very  large  molecules  rather  than  small  molecules.  The 
reason  is  that  the  effect  of  the  projection  (2)  becomes  simpler  for  large  N  . 
By  using  some  of  the  previous  results  one  can  easily  show  that,  for  a 


See  particularly  equations  (15)-(24)  in  P. O.  LSwdin,  Phys.  Rev.  97, 
1509  (1955). 


finite  value  of  S  and  'i  =  one  obtains 

/a  ^  i;-,  <ii-3ei^>  _  oi'gti:i)> 

\  v)-)-oo  <fn~>  OiJ)>  ^ 

i.e.  the  energy  of  the  spin  component  is  the  same  as  the  energy  of  the 

determinant  D  itself.  It  is  clear  that,  for  a  very  large  N  ,  a  single  spin  flip 
or  a  finite  number  of  flips  cannot  influence  the  total  energy,  so  that  the  singlet, 
triplet,  quintet, . « .  etc.  all  have  the  same  energy  in  this  case.  The  determinant 
D  contains  also  higher  spin  states  with  S/N  finite,  but  it  follows  from  (94) 
that  they  occur  in  such  a  small  portion  that  they  do  not  contribute  to  the  average 
energy  of  the  mixture  for  N  =  oo  .  A  detailed  study  of  the  spin  components  in 
D  is  now  being  carried  out  in  Uppsala. 

Formula  (95)  indicates  that,  for  large  N,  the  variation  with  respect 
to  the  starting  function  and  the  correlation  parameter  (93)  may  be 

carried  out  as  if  the  total  wave  fxinction  would  simply  be  the  Slater  determinant 
D  .  However,  the  singlet  wave  function  is,  of  course,  still  represented  by  the 
singlet  projection  of  D  ,  which  ensures  that  the  wave  function  is  invariant  under 
the  transformation  and  that  the  spin  density  is  identically  zero  every¬ 

where  in  space. 

It  should  be  mentioned  that  there  are  some  similarities  between  this 

approach  and  the  unrestricted  Hartree-Fock  scheme  developed  by  Slater  and 

- :: - - 

his  collaborators  It  was  pointed  out  by  Slater  that,  in  a  system  with  un- 


J.C.  Slater,  Phys.  Rev.  81,  335  (1951);  82,  538  (1951);  Revs.  Modern 
Phys.  199  (1953);  R.K.  Nesbet,  Proc  Roy.  Soc.  A230,  312  (1955); 
G.W.  Pratt  Jr.,  Phys.  Rev.  102,  1303  (1956);  J.H.  Wood  and 
G.W.  Pratt  Jr.,  Phys.  Rev.  107,  995  (1957);  R.K.  Nesbet  and 
R.E.  Watson,  Ann.  Phys.  %  ^60  (I960);  L.M.  Sachs,  Phys.  Rev.  117, 
1504  (1960);  R.E.  Watson  and  A.  J..  Freeman,  Phys.  Rev.  120,  1125 
(1960);  Phys.  Rev.  120,  1134  (1960). 


-67- 


balanced  spins  having  /  0  ,  the  electrons  with  plus  spin  and  those  with 

negative  spin  would  be  influenced  by  different  exchange  potentials.  One  could 

hence  expect  that  electrons  with  different  spins  would  have  different  orbitals, 

and  this  effect  was  called  exchange  polarization.  In  order  to  study  this  effect, 

Slater  approximated  the  total  wave  function  by  a  single  determinant  with 

different  orbitals  for  different  spins,  hiany  important  results  have  been  obtained 

131) 

so  far  by  this  approach,  particularly  with  respect  to  magnetic  behaviour  ' . 

For  a  detailed  comparison  between  the  unrestricted  and  the  extended  Hartree- 

132) 

-Fock  schemes,  we  will  refer  to  a  recent  paper  '. 


132) 


P.O.  LSwdln,  Ann.  Acad.  Reg.  Sci.  Upsaliensis  2,  127  (1958). 


The  main  result  of  this  section  is  that  one  can  obtain  an  essential 
lowering  of  the  total  energy  of  a  Slater  deterxninant  D  by  pernditting  "different 
orbitals  for  different  spins".  For  S^  =  0 ,  there  will  be  a  considerable  orbital 
splitting  due  to.  cor  relation  and,  for  ^  0  there  may  be  an  additional  exchange 
polarization.  The  basic  equations  are  the  same  as  in  the  original  Hartree-Fock 
scheme  characterized  by  (l)'(5),  but  no  symmetry  restrictions  are  imposed  on 
the  spin-orbitals  involved.  Instead  the  symmetry  properties  are  handled  by  a 
component  analysis  of  the  determinant 


If  this  component  analysis  is  omitted,  one  may  obtain  results  which 
look  paradoxical.  Compare  the  giant  spin  waves  in  A.  W.  Overhauser, 
Phys.  Rev.  Letters  4,  415,  462  (I960),  and  the  criticism  by  W.  Kohn 
and  S* J.  Nettel,  Phys.  Rev.  Letters^,  8  (1960);  K.  Sawada  and 
N.  Fukuda,  Progr.  Theoret.  Phys.  25,  653  (1961);  T.  Aral,  Argonne 
Report  1961  (unpublished). 


In  this  way,  it  seems  possible  to  obtain  an  extension  of  batnd  theory 
which  preserves  the  physical  simplicity  of  the  conventional  method  but  has  an 
essential  part  of  the  correlation  error  removed.  For  a  schematic  survey  of  the 
advantages  and  disadvantages  of  the  ordinary  band  theory,  the  valence  bond 
method,  and  the  combined  approach  outlined  here  in  the  form  of  a  table,  we 
will  refer  to  another  paper 


-68- 


6.  GENERAL  SELF -CONSISTENT -FI ELD  THEORY  AND 
EXACT  SOLUTION  TO  MANY -ELECTRON  PROBLEM 

For  a  long  time,  the  Hartree-Fock  scheme  was  considered  as  the 
essential  and  ultimate  theoretical  tool  for  understanding  the  independent-particle - 
-model  from  the.  point  of  view  of  many -particle  theory.  The  scheme  was  suc¬ 
cessfully  applied  to  the  electronic  clouds  of  the  atoms  and  their  shell  structure, 
to  the  mobile  ir-electrons  of  the  conjugated  compounds  in  orgeuiic  chemistry, 
and  to  the  band  structure  of  crystals.  One  believed  that  the  qualitative  and  to 
a  certain  extent  also  quantitative  success  of  the  scheme  depended  on  the  fact 
that  the  interactions  between  the  electrons  were  comparatively  weak,  and  that 
the  correlation  effects  could  be  considered  as  a  small  perturbation. 

The  picture  was  completely  changed  with  the  discovery  that  the 
independent-particle -model  seemed  to  work  extremely  well  also  for  the  atomic 
nuclei  in  the  so-called  nuclear  shell-model.  Here  the  explanation  could  hardly 
be  that  the  forces  were  weak,  and  it  seemed  necessary  to  find  an  extension 
of  the  independent-particle-model  which  would  work  also  for  strong  interactions 
between  the  particle.  Such  an  extension  has  been  developed  by  Brueckner 


K. A.  Brueckner,  C.A.  Levinson,  and  H.M.  Mahmoud,  Phys.  Rev. 

217  (1954);  K.A.  Brueckner,  Phys.  Rev.  96,  508  (1954); 

1353  (1955);  100,  36  (1955);  K.A.  Brueckner  and  C.A.  Levinson, 

Phys.  Rev.  97,  1344  (1955);  H.A.  Bethe,  Phys.  Rev.  1353  (1956); 

J.  Goldstone^  Proc.  Roy.  Soc.  (London)  A239,  267  (1957);  H.A.  Bethe 
and  J.  Coldstone,  Proc.  Roy.  Soc.  (London)  A238,  551  (1957); 

L. S.  Rodberg,  Ann.  Phys.  199  (1957);  to  mention  only  a  selection 
of  the  rich  literature  on  this  subject. 


and  his  collaborators.  The  new  scheme  is  based  on  the  use  of  a  scattering  or 
reaction  operator,,  where  the  correlation  between  any  two  particles  is  exactly 
included,  whereas  the  correlation  between  three  and  more  particles  is  neglected. 
This  so-called  Brueckner  approximation  works  very  well  for  nuclear  matter, 
since  the  forces  are  of  such  a  short-range  nature. 

For  an  electronic  system,  the  situation  is  a  little  bit  different,  since 
the  Coulomb  forces  are  of  such  a  long-  range  nature  that  it  may  be  necessary 
to  include  also  the  correlation  between  three  and  more  electrons.  This  is 
ultimately  a  question  of  order  of  nuignitude  and  depends  also  on  the  accuracy 


-69- 


I  desired.  Here  we  will  briefly  show  that  it  is  possible  to  extend  the  line  of 

'  development  which  goes  from  Hartree-Fock  to  Brueckner  still  further  and  re¬ 

late  the  exact  formal  solution  of  the  many-electroh  Schrfldinger  equation  to  the 
independent-particle-model  through  a  self-consistent-field  scheme  containing 
"avereige"  potentials 


P.  O.  Ldwdin,  Technical  Notes  47  and  48,  Uppsala  Quantum  Chemistry 
Group  (I960). 


Partitioning  Technique  for  Solving  Schrfldlnger  Equation.  -  One  of  the  strongest 
tools  for  solving  the  Schrfidinger  equation  HT  =  £?  in  one-electron  or  many- 
-electron  theory  is  rendered  by  the  partitioning  technique,  since  it  contains  many 
of  the  conventional  methods  as  special  cases  The  technique  is  also  con¬ 

venient  to  explain  the  projection  operator  formalism  that  we  are  actually  going 
to  use  to  solve  the  many-electron  problem. 


For  references,  seeP.O.  LSwdin,  Technical  Note  11,  Uppsala  Quan¬ 
tum  Chemistry  Group  (1958)  /uhpiiblished/. 


In  applying  Ritz's  expansion  method  discussed  in  Sec.  2c,  we  will 
introduce  a  complete  orthonormal  basis  {f^  }  and  write  the  eigenfunction  in 
the  form  7  =  .  f ^  c^  ,  where  the  coefficients  }  form  a  column  vector 

C.  •  The  system  (36)  may  then  be  written  in  the  condensed  matrix  form 


H  C  =  £  c  ,  W‘) 

which  is  simply  the  transform  of  the  original  SchrSdinger  equation  in  the 
discrete  representation  introduced.  Let  us  now  divide  or  "partition”  the  com¬ 
plete  basis  {fj^  }  into  two  subsets  (a)  and  (b),  so  that  the  set  (a)  contains  a 
finite  number  of  functions.  The  matrix  H  and  the  vector  C  may  then  be 
written  in  the  form 


-70- 


and  equation  (96)  may  be  written  ae  two  eqiiations: 


Solving  from  the  last  equation,  one  obtains 

«^v  =  (t  <u-HurHuC,  , 


and  substitution  of  this  expression  into  the  first  equation  gives 


(100) 

■  - 

(101) 


Equation  (lOO)  has  exactly  the  same  form  as  the  origin^  equation  (96),  but  the 
total  matrix  H  is  now  condensed  into  a  finite  matrix  defined  by 

(101).  This  technique  enables  us  to  concentrate  our  interest  on  a  certain  subset 
(a),  whereas  the  influence  of  the  other  subset  (b)  may  be  considered  as  a  "per¬ 
turbation"  represented  by  the  second  term  in  (lOl).  The  partitioning  teclmique 
may  be  used  in  many  different  theoretical  connections,  and  it  is  also  an  excellent 
tool  for  the  numerical  solution  of  secular  equations  of  very  high  orders  it 

is  then  often  convenient  to  choose  the  subset  (a)  as  consisting  of  a  single  element, 
and  the  method  will  still  render  both  discrete  and  degenerate  eigenvalues  with¬ 
out  any  difficulty. 


137) 


P.O.  Lfiwdin,  Adv.  Chexh.  Phys.  Z07  (Interscience,  New  York 
1959),  p.  270  f. 


-71- 


Projection  Operator  Formalism.  -  In  thi0  section,  we  will  rewrite  the  parti¬ 
tioning  technique  in  a  slightly  more  abstract  form.  Let  O  be  the  projection 
operator  which  selects  the  subspace  (a)  of  order  g  so  that 


©^=  ©  ,  ©  ,  T.(®)  =  j  . 


(102) 


The  operator  P  =  1  -  O  satisfies  the  relations  P  =  P ,  P^  =  P  and  OP  =  PO  = 
=  p, ,  and  it  is  apparently  the  projection  operator  for  the  subspace  (b),  which 
we  will  characterize  as  the  "orthogonal  complement"  to  the  subspace  (a). 

Let  us  start  by  considering  a  non -degenerate  level  E  and  choose 
g  =  1  .  Let  further  4>  be  an  arbitrary  trial  function  with  a  non-vanishing  projec¬ 
tion  04  =  9  ,  which  we  will  normalize  so  that  <  (p  |  (p  >  =  1 ,  i.  e.  <  4  |  O  |  4  >  = 
=  1  .  For  the  eigenfunction  ¥  ,  satisfying  (H  -  E)  Y  ,  one  has  the  identity 


^  =  ((D+P)i  =  <4>4-  PV<-'K  I  = 

,=  cp,  +  •PK"''  [!<  •+  P(.5C-E)(.®+T’)]  ^  = 
=  cp  +  PK''P4ecf'  +  ■PK"'  [vC-P  (E- 


Here  K  is  an  arbitrary  non-singular  operator  which  will  now  be  chosen  so  t^t 
we  get  rid  of  the  last  term  in  (103).  We  will  introduce  the  definitions  i 


7"=  'FX'‘‘P 


(104) 


In  matrix  notation,  we  would  say  that  K  represents  the  (bb)-"corner"  of  the 
matrix  fp~  H  )  ,  and  that  T  is  the  "inverse  of  the  corner"  ;  sSe  “ 

Fig.  4.  In  the  following,  we  will  often,  instead  of  the  full  definition.  T  = 


=  P  [  P(E  -  H)  P  ]  "  ^  P  use  the  symbolic  notation 


T 


p 


(105) 


but  we  have  to  remember  its  full  meaning.  It  is  clear  that  T 
relations 


satisfies  the 


©T  .  T(D  =  0 


P(E-3t)T  =  "P 


(106) 


-73- 


which  we  will  often  use  in  the  following.  From  (103),  we  obtain 

^  =  cp+  T^cf  »  ( © -<- T.atO )  ^  ,  (107) 

which  relation  is  analogous  to  (99) >  Of  special  interest  is  now  the  operator 

Sh  ^  ([)  -^  T^(D  ,  (108) 

since  this  operator  applied  to  any  trial  function  will  give  an  exact  solution 
7  =  O  4  ,  provided  that  04  ^  0  .  This  result  indicates  that  Q  is  an  eigen- 
operator  to  H  ,  i.  e.  that 

<5l-£L  =  ^  (109) 

2 

and  it  is  furthc^r  easily  seen  that  Q  =  Q  .  It  should  be  observed  that  O  ,  which 
consists  of  an  idempotent  term  O  wd  a  nil-potent  term  THO  does  not  commute 
with  its  adjoint  operator  12^  and  it  is  hence  not  a  normal  operator.  It  may 
be  characterized  as  a  non-normal  projection  operator,  and  its  importance  comes 
from  its  coimection  with  oe-order  perturbation  theory. 

From  (109)  follows  further  0(H  -  E)Q  =  OHO  +  OHTHO  -  OEO  «  O  , 
and  the  energy  relation: 

©£<D  -  ©(•3t +^T'ae)(D  .  (110) 

Multiplying  to  the  left,  and  right  by  4  and  integrating,  we  obtain 

E.  =  + 

which  relation  corresponds  to  the  well-known  SchrOdinger-BrUlouin  formula 


li.  Brillouin,  J.  Phys.  radium  (7)  33,  373  (193Z);  E./Wigner,  Math, 
naturw.  Anz.  ungar.  Akad.  Wise.  53,  477  (1935). 


138) 


-74- 


in  perturbation  theory;  the  latter  may  be  derived  from  (111)  by  expanding  the 
inverse  T  by  means  of  a  power-series  expansion.  The  corresponding  wave 
function  is  given  by  (107)  and  fulfills  the  norihalizatioh  <(p  |  ¥  >  =  1  .  Because 
of  this  connection,  the  projection  operator  formalism  based  on  Q  is  equiva’^ 
lent  to  co-order  perturbation  ^f^bry. 

In  (ill)  the  eigenvalue  problem  is  given  in  ah  implicit  form  E  =  ‘ 

=  f  (E)  ,  where 


.1^ 


E)  s  <cp  + 


'tE)  = 


(112) 

(113) 


It  is  natural  to  try  to  solve  this  problem  by  a  first-order  iteration  procedure 
based  on  the  formula  <=  f  {  E^^^}  ,  and  which  leads  to  a  series  of  values 

E^^^, . Putting  E^*^^  =  E  +  .  and  using  the  mean-value 

fe  f*{E  +  e  6 

obtains  ■■t 


’(0) 

theorem  f{E+  =f  (E)  + 


}  with  0  <  6  <  1  ,  one 


(114) 


Since  f  is  always  negative,  the  errors  c.  '  '  will  alternate  ip  sign,  whic] 
implies  that  the  successive  values  E^^^  will  alternately  be  upper  and  lower 
bounds  to  E  .  Hence  we  have  the  bracketing  theorem  that  between  two  con¬ 
secutive  values  in  the  series  E^®^,  E^^^.  ...  there  will  always  be  at 

least  one  eigenvalue.  The  procedure  will  be  convergent  if  |f'{  <  1  and  diver¬ 
gent  if  |f'  I  >  1  . 


A  much  faster  convergence  can  be,  obtained  by  going  oyer  to  a  eecond- 
-order  iteration  procedure,  e.g.  by  solving  the  equation  y  s  E  -  F  (E)  s  0  ; 

by  the  Newton-Raphson  process: 


'(O) 


r 


s'*>- 


(115) 


It  should  be  observed  that  the  right-hand  member  is  idexitical  witl|  ,tb<!"*tandard 
variational  expression  in  quantum  mechanics.  It  is  easily  shown  that  this.  .  . 
process  is  always  convergent. 


I 


-75- 


Cbimectlon  with  Schrfldinger  Perturbation  Theory.  -  Let  us  now  consider  the 
case,  when  H  =  +  V  where  V  is  an  arbitrary  weak  or  strong  perturba¬ 
tion.  We  will  assume  that  O  is  now  the  eigenoperator  to  associated 

with  the  level  E  under  consideration,  so  that  H^O  =  =  E^O  .  In  other 

words,  O  will  project  out  the  unperturbed  eigenfunction  .  We  note  that 
we  need  here  only  one  single  eigenfunction  to  and  not  the  complete  spectrum, 
which  is  an  essential  simplification;  the  orthogonal  complement  to  (p^  charac¬ 
terized  by  P  may  be  obtained  by  orthogonalizing  any  complete  set  towards  . 
From  (106),  (108),  and  (110)  follows  directly 

il  =  (l  +  TV)©, 

©E®  =  ®lE,  +  V-^VTV)0  . 


Of  particular  interest  is  here  the  operator 

4"  **  V  VTV  ^  (1,17; 

which  is  called  the  reaction  operator  associated  with  the  perturbation  V  ,  the 
unperturbed  Hamiltonian  ,  and  the  state  xmder  consideration.  Using  (Il6), 
we  obtain 


E  =  ,  ..(U8) 

showing  that  the  expectation  value  of  the  reaction  operator  '4'  with  respect  to 
the  unperturbed  state  gives  the  true  energy  shift.  Substitution  into  (117)  gives 
finally 

which  is  the  basic  formula  for  the  reaction  operator  in  our  theory.  There  is 
again  an  iterative  element,  which  may  be  handled  in  the  same  way  as  before. 

It  would  be  tempting  to  comment  on  the  linked- cluster  expansion  and  related 
problems  on  the  basis  of  tUs  formula,  but  it  would  take  us  too  far  in  this  connec¬ 
tion,  and  instead  we  wotild  like  to  refer  to  some  forthcoming  publications.  The 


-76- 


easential  thing  for  the  motnent  is  that  the  exact  reaction  operator  has  been  de¬ 
fined. 


Self-Consistent-Field  Theories.  -  In  order  to  review  some  of  the  common 
features  of  the  SCF -theories,  we  will  consider  a  total  many-particle  Hamiltonian 
of  the  form 


(120) 


J 


Here  H  ,  is  a  constant,  which  may  be  of  importance  from  the  point  of  view  of 
lO'  151 

convergence  '  but  which  plays  no  role  in  the  interaction  between  the  particles, 
so  that  it  may  temporarily  be  omitted.  Let  us  divide  this  Hamiltonian  into  two 
parts  H  *  H^  +  V  where 


■I  ^.<4  0  0  } 


(121) 


(122) 


and  u,  are  one-particle  potentials  at  our  disposal.  The  eigenvalue  problem 


connected  with  H^  is  separable,  and  we  obtain 


(123) 


where 


(124) 

(125) 


At  first,  we  will  leave  the  antisymmetry  requirement  aside.  In  the  so- 
called  Hartree  scheme,  the  total  wave  fimction  is.  actually  approximated  by  the 
simple  product  (123).  The  best  one-particle  functions  i|/^  are  deternoined  by  the 
variation  principle  6  <!  H  >  =  0  ,  which  leads  to  Hartree  equations  of  type  (124), 
with  Hartree  potentials  given  by  the  following  expressions: 


1 


-77- 


I 


-  uV + 

u) 

=  ^ 

it 

Ml 

=  ii. 

■  ■  ■  1 

°  <J 


) 


) 


(126) 


where  the  upper  index  k  indicates  the  order  of  the  interaction  term  in  the 
Hamiltonian,  from  which  the  ^effective  potential  has  been  derived.  For  the  total 
energy,  one  obtains 


-  <<<>«  III  (-36;-. 


(a) 


1  <4>/> 


which  means  that  <  >  is  not  identical  with  ;  actually  the  factor 

oP/k\  ® 

(l/k)  connected  with  '  prevents  the  k-body  interaction  to  be  counted  k 

times  as  it  would  be  in  £  =  E  . 

o  i  1 

In  addition  to  ,  we  will  consider  the  "singly  excited"  function 
9  '  ,  which  is  obtained  from  9  by  replacing  one  (and  only  one)  of  the  fimc- 

tions  «|'|(  by  another  «|<|^  which  is  assumed  to  be  orthogonal  to  the  former,  so 
that  I  »  0  .  Using  (122)  and  (126),  one  obtains  directly 


=  0 , 


(12») 


which  is  a  form  of  Brillouln's  theory.  laying  that  all  matrix  elements  of  the 
perturbation  V  between  the  basic  function  9^  and  all  singly  excited  func¬ 
tions  will  vanish  identically.  Since  V  «  H  -  H  ,  one  gets  further 

op  o  . 

I  I  %  ■  (129) 

We  note  that  this  relation  docs  not  prevent  the  sixigly  excited  functions  to  appear 
.  in  the  expansion  of  the  exact  solution,  since  they  may  come  in  through  couplings 
with  terms  which  are  at  least  doubly  excited. 

After  this  introduction,  we  will  discuss  the  exact  SCF-theory  connected 
with  the  product  (123).  For  this  purpose,  we  will  assume  that  we  have  the 
potentials  u^  at  our  disposal  and  introduce  the  projection  operator  O  connected 


-78- 


with  and  .  According  to  (118),  the  exact  energy  is  now  given  by  the 

expression  E  =  +  <  q>^  |  t  |  >  where  the  reaction  operator  t  is  defined 

»»y  (H9),  It  must  be  possible  to  write  t  in  the  form 


S  X  = 


,ZI  tu  + 

3 


where  we  have  separated  out  the  one-particle  part  and  denoted  the  inter¬ 

action  part  by  t  ;  the  latter  consists  of  a  two-particle  term,  a  three -particle 
term,  etc.  The  total  energy  can  now  be  written  in  the  form 


This  expression  is,  in  principle,  exact  and  cannot  be  improved  by  variation. 
However,  in  order  to  get  a  connection  with  the  Hartree- scheme,  we  will  now 
remove  the  coupling  between  and  r  and  conkider  t  as  a  fixed  given 

operator.  The  expression  (131)  is  then  no  longer  invariant,  and  the  best  function 
is  determined  by  equations  of  type  (123)  and  (124)  with  potentials  u^  given 
by  the  conditions: 


(132) 


i.e.  exactly  the  same  relations  as  (126)  but  with  the  inter  action,  terms  from  the 
Hamiltonian  replaced  by  the  reaction  terms  from  t  .  This  gives  finally 


U) 


4“  ’  • 


(133) 


1 


-79- 


in  complete  analogy  with  (127)  • 

The  SCF-potentials  are  here  considerably  more  complicated  than  in 
the  Hartree- scheme,  but  the  energy  (133)  is  also  the  true  energy  containing  all 
correlation  effects.  They  may  be  calctdated  by  a  SCF -procedure  based  on  the 
following  "flow  diagram": 


Each  cycle  is  here  more  complicated  than  the  corresponding  cycle  (5),  since  it 
involves  the  evaluation  of  the  reaction  operator  t  .  This  step  corresponds 
actually  to  an  exact  solution  of  the  Schrddinger  equation,  which  it  ought  to  be 
sufficient  to  carry  out  only  once.  There  exists  hence  probably  a  short-cut, 
perhaps  by  means  of  the  first-order  density  matrix,  and  research  on  this  point 
is  in  progress. 

Instead  of  (128)  in  the  Hartree  scheme,  one  obtains  here  directly 

1  I  >  =  0  ,  (>«) 


This  theorein  has  the  important  consequence  that,  if  the  exact  wave  function 

f  is  expanded  in  terms  of  Hartree  products  built  up  from  the  basic  orbitals 

i|>l,  *  ’  *  '^  their  orthogonal  complement,  the  leading  term  will  be  , 

and  the  expansion  will  further  contain  only  terms  which  are  at  least  doubly 

135) 

excited  with  respect  to  9^  '•  This  theorem  is  of  importance  in  calculating 

expectation  values  of  one-particle  operators,  and  it  gives  a  certain  physical 
significance  also  to  the  "model"  function  9^  . 

It  is  now  possible  to  follow' the  line  from  Hartree  by  way  of  Bru.eckner 
to  the  exact  S<^ -theory.  Apparently,  the  degree  of  accuracy  depexids  on  how 
one  has  approximated  the  interaction  part  x  of  the  reaction  operator  t  ,  and 
one  has: 


-80- 


Hartree:  "C  ^  S  “^  '  ’ '  ! 

■l<j  j  ^  '' 

Brueckner  X  ~  y"*,  (^36) 

i<j  "'I  ' 

Exact  SCF -theory  X  =  IC  T;;  ■*•  > 

0  i<j<i«  0 

Synunetry  Requirements  in  SCF-Theoriea»  -  In  discussing  correlation  effects, 
the  symmetry  requirements  are  certainly  highly  important.  In  the  theory  of 
fermions,  the  antisymmetry  requirement  connected  with  Pauli's  exclusion 
principle  diminishes  the  original  correlation  error  connected  with  the  Hartree- 
-product  with  about  50  o/o,  since  it  eliminates  the  main  part  of  the  correlation 
error  connected  with  particles  having  parallel  spins.  In  Sec.  5,  we  have  seen 
that  the  proper  use  of  spin  projection  operators  for  certain  systems  may  remove 
another  85  o/o  of  the  correlation  error  associated  with  electrons  having  anti- 
parallel  spina,  <  so  that  actually  only  about  l/ 12  of  the  original  error  has  to  be 
accounted  for  by  real  many-particle  theory.  Hence  it  is  highly  desirable  to 
incorporate  the  symmetry  properties  in  the  SCF -theories. 

The  antisymmetry  property  for  fermions  is  easily  included  by  means 
of  the  antisymmetry  projection  operator: 

-  =  («!)'’ S  h/P  , 

and,  instead  of  the  total  Hilbert  space  spanned  by  the  complete  set  {f^  }  ,  we 
will  Jiow  consider  only  the  euitisymmetric  subspace  spanned  by  the  subset 
{0^gf|^  }.  Instead  of  starting  from  the  Hartree  product  (123),  we  will  now 
base  our  study  on  the  corresponding  Slater  determinant. 

The  Hartree-Fock  scheme  is  characterized  by  potentials  of  the  type 
(126),  but  the  interaction  terms  are  now  multiplied  by  reduced  antisymmetriza- 
tion  operators^  so.  that 


(138) 


-81- 


and  this  introduces  an  essential  simplification  in  the  definitions  of  the  Hartree- 
-Fock  potentials  U|^  ,  since  one  can  now  take  away  the  restrictions  j  i  t 
j  /  k  /  i  ,  . . .  in  (l26)  and  sum  over  all  indices.  This  implies  that  the  Hartree- 
-Fock  potentials  will  be  the  same  for  all  particles,  and  that  these  potentials  are 
conveniently  expressed  in  terms  of  the  fundamental  invariant  7  -defined  by  (4). 


In  the  exact  SCF -theory,  we  can  now  confine  our  interest  to  the  anti¬ 
symmetric  subspace  alone,  and,  within  this  subspace,  we  can  now  repeat  the 
partitioning  procedure  and  evaluate  the  corresponding  reaction  operator  t  . 


It  appears  that  the  previous  reaction  terms  T 


ij  '  ^ijk  ' 


will  be  modified 


according  to  (138),  so  that  one  can  remove  the  summation  reitriction  in  (132) 
and  base  the  entire  discussion  on  the  fundamental  invariant  ^  .  In  this 

respect,  the  introduction  of  the  exchange  terms  simplifies  the  structure  of  the 
SCF -theory. 


In  Sec.  2b,  we  studied  the  consequences  of  the  translational  symmetry 
of  a  crystal,  and  the  same  type  of  discussion  can  now  be  repeated. here.  It 
turns  out  that  the  basic  spin-orbitals  should  be  Bloch-functions,  that  the  fimda- 
mental  invariaht  ^  has  translational  symmetry.(3i),  and  that  these  properties 
are  self-consistent  and  lead  to  an  exact  wave  function  which  is  an  eigenfunction 
to  the  total  translations  .  This  means  that  the  important  concepts  connected 

with  the  space  of  the  reduced  wave  vector  in  the  one -electron  model  will 

keep  a  certain  meaning  also  in  the  exact  noany-electron  theory,  and  many  of  the 
semi-empirical  discussions  and  interpretations  carried  out  with  the  aid  of 
these  concepts  may  hence  have  a  deeper  validity  than  one  could  expect  on  the 
basis  of  the  Hartree-Fock  scheme  alone.  The  aim  of  this  approach  is  hence  to 
give  a  full  justification  of  band  theory  within  the  exact  many-electron  theory. 

In  conclusion,  let  us  assume  that  there  exists  another  normal  constant 
of  motion  A  ,  which  commutes  with  and  with  O^g  say  the  total  spin 

(S  ,  S^)  .  By  introducing  the  associated  set  of  projection  operator  of  e.g. 

type  (81),  one  can  now  split  the  antisymmetric  basis  ^  series 

of  subsets  ^AS  ^  each  eigenvalue  to  A  .  We  can  now  confine 

our  interest  to  one  of  theses  subspaces,  which  is  entirely  independent  of  all  the 
other  subspaces,  being  not  only  orthogonal  but  also  non-interacting  with 
respect  to  and  A  .  Within  this  subspace,  we  can  now  carry  out  our 

partitioning  procedure,  evaluate  the  reaction  operator  t  ,  and  construct  an 
exact  SCF  -theory  baaed  on  a  fundamental  invariant  ^  .  This  is  apparently  a 
generalination  of  the  extended  Hartree-Fock  scheme  discussed  in  Sec.  5  to  an 
exact  form.  It  has  already  been  emphasized  that  the  main  part  of  the  correlation 
error  affecting  the  original  Hartree  scheme  is  removed  by  an  inclusion  of  the 


-82- 


synnmaetry  requirements  through  the  projection  operator  technique,  and  only 
a  comparatively  small  part  of  the  correlation  error  has  then  to  be  treated  by 
true  many-particle  theory,  i.e.  by  a  study  of  the  reaction  operator. 

The  relation  between  the  various  types  of  SCF- schemes  has  been 
sketched  in  Fig.  5. 


7.  CONCLUDING  REMARKS 

The  goal  of  the  many-electron  theory  is  to  express  the  exact  wave 
fimction  in'a. simple  form,  e.g.  in  terms  of  an  expansion  which  is  as  rapidly' 
convergent  as  poasiblh  and  Which  contains  a  dominant  term  which  has  a  simple 
physical  interpretation.  There  are  particularly  four  forms  which  have  been 
used  so  far 


139) 


P.O.  LSwdin,  Revs.  Modem  Phys.  32,  328  (i9$|ll). 


^  V  ' 


(139) 

(140) 


Here  the  first  form  is  an  expansion  in  terms  of  Slater  determinants  based 

on  one-electron  functions,  the  second  an  expansion  in  terms  of  projections  of 
determinants  >  whereas  the  two  last  forms  are  similar  but  contain  a 

"correlation  factor"  g,  *  g (.  -  )  which  is  a  symmetric 

function  of  the  coordinates.  The  correlation  factor  was  first  introduced  by 
Hylleraas  andt  in  connection  with  crystal  theory,  it  has  been  pointed  out  by 
Krisement  that  the  form  ¥  «  gD  is  closely  connected  -both  with  Wigner's 


140) 

141) / 

i. 


E.A.  Hylleraas,  Z.  Physik  M,  347  (1929). 
O.  Krisement,  Phil.  Mag.  245  (1957). 


classical  theory  for  the  electrons  in  an  alkali  metal  and  Bohm  and  Pines's 
plasma  model.  In  the  latter,  the  correlation  factor  has  the  following 


Fig*  5.  Schematic  aurrey  of  the  varioui  SCF- 

-  scheme  a  which  may  be  utilieed  in 
connection  with  the  development  of 
band  theory. 


-84- 


(141) 


142) 


See  D.  Pines.  Solid  State  Physics 
1955),  p.  391. 


368  (Academic  Press.  New  York 


and  corresponds  physically  to  the  coUectiye  motions  of  the  electrons;  is 
the  cut-off  vector  for  the  plasma  oscillations  and  is  the  plasma  frequency. 

The  collective  behaviour  should,  of  course,  come  out  as  a  result  of  the  reaction 
operator  formalism,  and  it  should  be  mentioned  that  this  problem  has  recently 
been  studied  by  Hubbard  using  infinite-order  perturbation  theory. 


J.  Hubbard,  Proc.  Roy.  Soc.  (London)  A240,  539  (1957);  A243,  336 
(1957);  A244.  199  (1958). 


We  have  here  confined  our  interest  to  the  stationary  crystal  states 
described  by  the  time -independent  Schrddinger  equation,  but  the  basic  problems 
in  crystal  physics  could,  of  course,  also  be  treated  by  considering  the  time- 
-dependent  wave  equation: 


A- 


(142) 


This  equation  has  a  solution  of  the  form  f  (t)  =  U  (t,  0)  Y  (0)  where  the  "evolu¬ 
tion*  operator  U  is  a  unitary  operator  which  may  be  treated  by  the  «o  -  order 
perturbation  thSory  systematized  by  the  Feynmann  diagram  technique  This 


144) 


R.P.  Feynman,  Phys.  Rev.  76,  749,  769  (1949). 


approach  has  not  been  discussed  here  at  all,  but  it  should  be  mentioned  that 
I  important  work  on  the  fundaments  of  crystal  theory  has  recently  been  made 

along  this  line.  Actually  Hubbard's  treatment  of  the  collective  motions  mentioned 
above  was  based  on  the  use  of  the  diagram  technique. 

i 

1 


! 

I 

1 


4 


-85- 


In  connection  with  the  plasma  model*  it  wa«  also  pointed  out  that  there 
was  a  short-range  correlation  effect  in  the  form  of  a  very  efficient  screeailig 
which  could  simplest  be  described  as  a  dielectric  behaviour  of  the  electrons. 
This  phenomenon  and  related  problems  have  been  particularly  studied  in  the  so- 
.  -called  dielectric  approximation  Lindhard  derives  the  essential  features 


J.  Lindhard,  Kgl.  Danske  Videnskab.  Selskab. ,  Mat.  -  fys.  Mcdd.. 

28,  3  (1954);  J.  Hubbard,  Proc.  Phys.  Soc.  (London)  A68,  976 
(1955);  and  references  143;  P.  Nosieres  and  D.  F^es,  Phys.  Rev. 
109,  741,  762  (1958);  Nuovo  Ci|nentoj9,  470  (1958);  J.J.  Quinn  and. 
R.A.  Ferrell,  Phys.  Rev.  112,  812  (1958);  H.  Ehrenreich  and 
M.H.  Cohen,  Phys.  Rev.  1 IS,  786  (1959);  t>.F.  Dubois,  Ann.  ^ys. 
7,  174  (1959);  8,  24  (1959);  A.  Klein,  Phys.  Rev.  1136  (l9ir9); 

J.  Callaway,  Phys.  Rev.  116,  1368  (1959);  D.S.  Falk,  Phys.  Rev. 
118,  105  (1960);  G.R.  Pratt,  Phys.  Rev.  462  (1960);  F.  Englert 
and  R.  Brout,  Phys.  Rev.  120,  108'5  (i960);  and  others.' 


of  this  approach  starting  out  simply  from  the  time-dependent  SCF^uqu^^flons, 
whereas  later  authors  have  often  utilized  the  diagram  technique  and  the  full  .w  - 
-order  perturbation  theory.  This  method  has  given  particularly  important 
information  as  to  how  die  electrons  in  a  crystal  behave  when  a  weak  outer 
electromagnetic  field  is  applied. 

To  an  experimentalist,  the  recent  development  of  the  quantum  theory . 
of  the  electronic  structure  of  crystals  may  seem  rather  complicated,  and  the 
question  is  whether  one  could  find  some  form  of  simple  connection  between  the 
mie-electron-model  and  the  exact  many-electron  theory  which  could  be  used 
in  interpreting  experiments  and  constructing  semi -empirical  theories.  In  this 
connection,  we  would  like  to  direct  the  attention  to  the  importance  of  the  natUrid 
spin  orbitals  >  which  diagonalize  the  first-order  density  matrix 

^ 


so  that 


.86- 


It  may  b«  •hown  that,  if  th«  total  wave  function  f  ia  an  eigenfunction  to  the 
total  tranelatione  ,  then  the  natural  spin-orbitale  are  (or  may  be  chosen 

as)  Bloch  functioas  )  associated  with  the  space  of  the  raducad 

wave  rector  H  ,  where  we  have ':p!at  .  Instead  of  (144), -one!  . 

obtains 

(^> 

CE  ,  (»“> 

Ml 

and  the  number  of  electrons  aisociated  with  the  point  ^  nkay  now  be  defihed 
by  the  es^ression 

'n(^)  =  <  i:  ©^w  >  - 

-o-i  ^ 

t 

Within  the  framework  of  the  exact  noany-electron  theory,  it  is  in  this  way 
possible  to  construct  a  series  of  concepts  which  are  connected  with  the  points 
in  ^  -space. 

For  the  kinetic  energy  T*  (%  )  associated  with  the  point  ^  one 
obtains  for  instance 


(146) 


«/*l  '  iw  ) 


(147) 


and  the  "effective  mass"  ^  (^) 


for  the  kinetic  energy  could  then  be 


146) 


Compare  W.  Kohn,  Phys.  Rev.  105,  509  (1957). 


defined  by  the  expression 

T(a) 


(M8) 


87- 


This  approach  gives  hence  certain  features  of  the  conceptual  structure  of  the 
theory. butt  of  course,  one  does  not  obtain  any  quantitative  results,  until  one.  . 
knows  the  exact  wave  function  '!  or  tibe  associated  density  matrices.  Froin 
the  experimental  point  of  view,  it  woxild  be  particularly  important  if  one  in  this 
way  could  construct  a  semi-empirical  theory  and  avoid  the  formal  solution  of 
the.  Schrfldinger  equation.  The  results  obtained  so  far  make  it  likely  that  such  a 
development  may  be  quite  possible. 

For  a  period  of  about  twentyfive  years,  band  theory  and  valence  bond 
me&od  were  applied  to  the  problem  of  the  electronic  structure  of  crystals  in. 
their  original  form,  la  this  review,  we  have  tried  to  sketch  some  of  the  fast 
development  which  has  occurred  in  this  field  during  the  last  decade,  the  refine¬ 
ment  of  the  conceptual  framework  and  the  drive  towards  higher  accuracy  in  the 
solution  of  the  SchrSdinger  equation.  Many  important  results  have  been  obtained, 
and  it  seems  safe  to  predict  that,  during  the  next  decade,  still  more  fundamental 
results  of  importance  for  the  understanding  of  the  chemichl  pkysics  of  crystals 
will  be  achieved. 


ACKNOWLEDGEMENTS 

The  author  is  greatly  indebted  to  F.K.  Anders  FrAman  and  F.M.  Jean- 
-Loois  Calais  for  valuable  assistance  in  going  through  the  recent  literature 
and  preparing  the  references. 


